Primary Country (Mandatory)

Other Country (Optional)

Set News Language for United States

Primary Language (Mandatory)
Other Language[s] (Optional)
No other language available

Set News Language for World

Primary Language (Mandatory)
Other Language(s) (Optional)

Set News Source for United States

Primary Source (Mandatory)
Other Source[s] (Optional)

Set News Source for World

Primary Source (Mandatory)
Other Source(s) (Optional)
  • Countries
    • India
    • United States
    • Qatar
    • Germany
    • China
    • Canada
    • World
  • Categories
    • National
    • International
    • Business
    • Entertainment
    • Sports
    • Special
    • All Categories
  • Available Languages for United States
    • English
  • All Languages
    • English
    • Hindi
    • Arabic
    • German
    • Chinese
    • French
  • Sources
    • India
      • AajTak
      • NDTV India
      • The Hindu
      • India Today
      • Zee News
      • NDTV
      • BBC
      • The Wire
      • News18
      • News 24
      • The Quint
      • ABP News
      • Zee News
      • News 24
    • United States
      • CNN
      • Fox News
      • Al Jazeera
      • CBSN
      • NY Post
      • Voice of America
      • The New York Times
      • HuffPost
      • ABC News
      • Newsy
    • Qatar
      • Al Jazeera
      • Al Arab
      • The Peninsula
      • Gulf Times
      • Al Sharq
      • Qatar Tribune
      • Al Raya
      • Lusail
    • Germany
      • DW
      • ZDF
      • ProSieben
      • RTL
      • n-tv
      • Die Welt
      • Süddeutsche Zeitung
      • Frankfurter Rundschau
    • China
      • China Daily
      • BBC
      • The New York Times
      • Voice of America
      • Beijing Daily
      • The Epoch Times
      • Ta Kung Pao
      • Xinmin Evening News
    • Canada
      • CBC
      • Radio-Canada
      • CTV
      • TVA Nouvelles
      • Le Journal de Montréal
      • Global News
      • BNN Bloomberg
      • Métro
Researchers warn of unchecked toxicity in AI language models

Researchers warn of unchecked toxicity in AI language models

CTV
Monday, April 22, 2024 11:21:12 AM UTC

As OpenAI’s ChatGPT continues to change the game for automated text generation, researchers warn that more measures are needed to avoid dangerous responses.

As OpenAI’s ChatGPT continues to change the game for automated text generation, researchers warn that more measures are needed to avoid dangerous responses.

While advanced language models such as ChatGPT could quickly write a computer program with complex code or summarize studies with cogent synopsis, experts say these text generators are also able to provide toxic information, such as how to build a bomb.

In order to prevent these potential safety issues, companies using large language models deploy safeguard measures called “red-teaming,” where teams of human testers write prompts aimed at provoking unsafe responses, in order to trace risks and train chatbots to avoid providing those types of answers.

However, according to researchers with Massachusetts Institute of Technology (MIT), “red teaming” is only effective if engineers know which provocative responses to test.

In other words, a technology that does not rely on human cognition to function still relies on human cognition to remain safe.

Researchers from Improbable AI Lab at MIT and the MIT-IBM Watson AI Lab are deploying machine learning to fix this problem, developing a “red-team language model” specifically designed to generate problematic prompts that trigger undesirable responses from tested chatbots.

"Right now, every large language model has to undergo a very lengthy period of red-teaming to ensure its safety,” said Zhang-Wei Hong, a researcher with the Improbable AI lab and lead author of a paper on this red-teaming approach, in a press release.

Read full story on CTV
Share this story on:-
More Related News
She was a broke teenager stranded in a strange town. Then two nuns saved the day

The year was 1973 and Diann Droste was 16, on a Greyhound bus on her way home. When the bus encountered a snow storm and an unexpected detour, Diann experienced an act of kindness from strangers she’d never forget.

Florida braces for frost and possible snow flurries as winter storms hit other parts of the U.S.

Florida won’t be getting hit with massive blankets of snow and ice like the rest of the U.S., but even frosty windshields and a few flurries can feel like Antarctica to people with permanent sandal tans.

LinkedIn co-founder urges tech leaders to denounce Trump

LinkedIn co-founder Reid Hoffman said on Thursday that more tech leaders should “speak out” against Donald Trump’s administration, after two American citizens were killed by federal agents in Minneapolis.

Men getting twice as much plastic surgery, new data shows

The amount of plastic surgery performed on men has nearly doubled in less than a decade, new data showed on Thursday.

‘I just took the leap and leapt into it’: London man set to row across Atlantic Ocean for brain tumour research

For rower Kyle Wills every stroke forward comes with pain, but that discomfort serves as a reminder of why he’s on the water.

Google adds AI image generation to Chrome browser, side panel option for virtual assistant

Google is adding to the Chrome browser the ability to alter imagery and a virtual assistant to help with online tasks to turbocharge its digital services.

From drug access to food prices, here’s why U.S. tariffs can affect your health

As the United States continues to place tariffs on nations around the world, economists have focused on inflation, markets and trade balances. But researchers warn the fallout could also show up in doctors’ offices, grocery aisles and hospital budgets.

© 2008 - 2026 Webjosh  |  News Archive  |  Privacy Policy  |  Contact Us