Primary Country (Mandatory)

Other Country (Optional)

Set News Language for United States

Primary Language (Mandatory)
Other Language[s] (Optional)
No other language available

Set News Language for World

Primary Language (Mandatory)
Other Language(s) (Optional)

Set News Source for United States

Primary Source (Mandatory)
Other Source[s] (Optional)

Set News Source for World

Primary Source (Mandatory)
Other Source(s) (Optional)
  • Countries
    • India
    • United States
    • Qatar
    • Germany
    • China
    • Canada
    • Singapore
    • World
  • Categories
    • National
    • International
    • Business
    • Entertainment
    • Sports
    • Special
    • All Categories
  • Available Languages for United States
    • English
  • All Languages
    • English
    • Hindi
    • Arabic
    • German
    • Chinese
    • French
  • Sources
    • India
      • AajTak
      • NDTV India
      • The Hindu
      • India Today
      • Zee News
      • NDTV
      • BBC
      • The Wire
      • News18
      • News 24
      • The Quint
      • ABP News
      • Zee News
      • News 24
    • United States
      • CNN
      • Fox News
      • Al Jazeera
      • CBSN
      • NY Post
      • Voice of America
      • The New York Times
      • HuffPost
      • ABC News
      • Newsy
      • USA TODAY
      • NBC News
      • CNBC
    • Qatar
      • Al Jazeera
      • Al Arab
      • The Peninsula
      • Gulf Times
      • Al Sharq
      • Qatar Tribune
      • Al Raya
      • Lusail
    • Germany
      • DW
      • ZDF
      • ProSieben
      • RTL
      • n-tv
      • Die Welt
      • Süddeutsche Zeitung
      • Frankfurter Rundschau
    • China
      • China Daily
      • BBC
      • The New York Times
      • Voice of America
      • Beijing Daily
      • The Epoch Times
      • Ta Kung Pao
      • Xinmin Evening News
    • Canada
      • CBC
      • Radio-Canada
      • CTV
      • TVA Nouvelles
      • Le Journal de Montréal
      • Global News
      • BNN Bloomberg
      • Métro
    • Singapore
      • CNA
      • The Straits Times
      • Lianhe Zaobao
AI models show alarming vulnerability to generating harmful content
Premium

AI models show alarming vulnerability to generating harmful content Premium

The Hindu
Friday, May 09, 2025 03:00:42 AM UTC

AI models like Mistral’s Pixtral can be both groundbreaking tools and potential vectors for misuse.

Advanced AI models that showcase unparalleled capabilities in natural language processing, problem-solving, and multimodal understanding have some inherent vulnerabilities that expose critical security risks. While these language models’ strength lie in their adaptability and efficiency across diverse applications, those very same attributes can be manipulated.

A new red teaming report by Enkrypt AI underscores this duality, demonstrating how sophisticated models like Mistral’s Pixtral can be both groundbreaking tools and potential vectors for misuse without robust, continuous safety measures. It has revealed significant security vulnerabilities in Mistral’s Pixtral large language models (LLMs), raising serious concerns about the potential for misuse and highlighting a critical need for enhanced AI safety measures.

The report details how easily the models can be manipulated to generate harmful content related to child sexual exploitation material (CSEM) and chemical, biological, radiological, and nuclear (CBRN) threats, at rates far exceeding those of leading competitors like OpenAI’s GPT-4o and Anthropic’s Claude 3.7 Sonnet.

The report focuses on two versions of the Pixtral model: Pixtral-Large 25.02, accessed via AWS Bedrock, and Pixtral-12B, accessed directly through the Mistral platform.

Enkrypt AI’s researchers employed a sophisticated red teaming methodology, utilising adversarial datasets designed to mimic real-world tactics used to bypass content filters. This included “jailbreak” prompts – cleverly worded requests intended to circumvent safety protocols – and multimodal manipulation, combining text with images to test the models’ responses in complex scenarios. All generated outputs were then reviewed by human evaluators to ensure accuracy and ethical oversight.

The findings are stark: on average, 68% of prompts successfully elicited harmful content from the Pixtral models. Most alarmingly, the report states that Pixtral-Large is a staggering 60 times more vulnerable to producing CSEM content than GPT-4o or Claude 3.7 Sonnet. The models also demonstrated a significantly higher propensity for generating dangerous CBRN outputs – ranging from 18 to 40 times greater vulnerability compared to the leading competitors.

The CBRN tests involved prompts designed to elicit information related to chemical warfare agents (CWAs), biological weapon knowledge, radiological materials capable of causing mass disruption, and even nuclear weapons infrastructure. While specific details of the successful prompts have been excluded from the public report due to their potential for misuse, one example cited in the document involved a prompt attempting to generate a script for convincing a minor to meet in person for sexual activities – a clear demonstration of the model’s vulnerability to grooming-related exploitation.

Read full story on The Hindu
Share this story on:-
More Related News
Community Health Officers protest for workload reduction and incentives

Community Health Officers in Vijayawada protest for reduced workload and continued incentives amid growing dissatisfaction over salary changes.

13,929 posts vacant in public health system in Telangana, DME and TVVP worst-hit

Telangana faces 13,929 healthcare vacancies, primarily in doctors and nurses, as recruitment efforts aim to address the shortage.

The need to integrate nutrition in TB care Premium

For most patients with TB who are severely underweight, nutritional support is an essential and not optional part of treatment

Sattankulam custodial deaths: Family wants strong punishment for convicted policemen

Family of custodial death victims demands harsh punishment for nine convicted policemen to prevent future abuses of power.

Rajinikanth refutes TVK leader Aadhav Arjuna’s claim on the actor refraining from politics

Rajinikanth refutes TVK leader's claims about DMK threats preventing his political entry, expressing gratitude to supportive political figures.

CM announces Cabinet Sub-Committee on Musi rejuvenation; rehabilitation for all displaced

CM announces Cabinet Sub-Committee on Musi rejuvenation; rehabilitation for all displaced

Ramanathapuram has 145 polling stations identified as sensitive, says DEO Simranjeet Singh Kahlon

Ramanathapuram identifies 145 polling stations as sensitive, ensuring security with paramilitary forces ahead of assembly elections, says DEO.

Puducherry election: BJP Puducherry unit president files nomination from Raj Bhavan

BJP Puducherry president V. P Ramalingam files nomination for Raj Bhavan, joined by Union Minister Mandaviya and Khushbu Sundar.

History-sheeter ‘Thoppai’ Ganesh shot dead by police near Madhavaram in Chennai

History-sheeter 'Thoppai' Ganesh shot dead by police in Chennai after attacking officers during attempted escape.

Centre to amend the Foreign Contribution (Regulation) Act

The government plans to amend the FCRA, introducing new regulations for NGO asset management and accountability for key functionaries.

Draft policy on ‘Responsible Digital Use Among Students’ sets goals for parents, schools

The draft policy for “Responsible Digital Use Among Students”, released on Monday by the Department of Health and Family Welfare, has recommended that parents set structured routines with clear screen-time rules and prioritise privacy, safety, and open conversation with children on digital well-being.

Police Observer’s surprise visit to police station

Police Observer Sushant Kumar Saxena conducts surprise inspection at Shencottai police station, reviewing warrants and security arrangements ahead of elections.

A.P. govt. tracking 760 projects, bunch of them to be grounded in April, says Nara Lokesh

Andhra Pradesh government to launch key projects in April, focusing on industrial growth and infrastructure development, says Nara Lokesh.

Assembly Elections LIVE updates: ECI approves transfer of returning officers for West Bengal polls

Assembly polls 2026: Follow updates from The Hindu on the upcoming elections in five States on March 24, 2026

Per capita income of Delhi to grow at 7.09%: Economic Survey

Delhi's per capita income is projected to grow 7.09% by 2025-26, highlighting the city's economic progress and infrastructure challenges.

Review meeting with nodal officers held

Review meeting held in Dharmapuri with election observers and nodal officers to discuss upcoming Assembly elections.

Execution of development projects, infrastructure augmentation to occupy centre stage in electioneering

Execution of development projects, infrastructure augmentation to occupy centre stage in electioneering

Telangana Cabinet clears Hate Speech and Parents Support Bills

Telangana Cabinet approves Hate Speech and Parents Support Bills, ensuring accountability for neglecting elderly and curbing hate-driven offences.

EU and Australia agree on text of free trade pact, announce a new defence partnership

EU and Australia finalize a free trade agreement and establish a new defense partnership to enhance economic and military cooperation.

Iran-Israel war LIVE: Trump says Iran 'wants to make a deal', but Islamic Republic denies any talks

U.S.-Israeli war on Iran LIVE: Iran faces deadline of US President Donald Trump warning to open Strait of Hormuz. Follow The Hindu for more LIVE updates on March 23 2026 on Iran-Israel war.

People will assess governance failure, corruption and favour UDF: KPCC chief

KPCC chief Sunny Joseph emphasizes UDF's focus on Sabarimala and corruption while expressing confidence in winning Kerala's upcoming Assembly elections.

At 34 killed in Colombia military plane crash

A military plane crash in Colombia leaves 125 aboard, with at least 48 survivors rescued amid ongoing recovery efforts.

‘Symposium’ to promote lifelong learning and combat misinformation

‘Symposium’, a new platform to promote lifelong learning and counter misinformation among youth, was launched in Hyderabad.

From disease-specific to person-centred care Premium

By using TB as an entry point, integrated healthcare delivery can optimise and improve public health efficiency

Salem West Constituency: deteriorating ground water quality worries residents of Chettichavadi

Residents of Chettichavadi express concerns over deteriorating groundwater quality due to long-standing garbage dumping issues in Salem West.

© 2008 - 2026 Webjosh  |  News Archive  |  Privacy Policy  |  Contact Us