Subscribe

Welcome

Subscribe

'Jailbreaking' AI services like ChatGPT and Claude 3 Opus is much easier than you think

'Jailbreaking' AI services like ChatGPT and Claude 3 Opus is much easier than you think

'Jailbreaking' AI services like ChatGPT and Claude 3 Opus is much easier than you think

Apr 13, 2024 47 secs

The scientists outlined their findings in a new paper uploaded to the sanity.io cloud repository and tested the exploit on Anthropic's Claude 2 AI chatbot.
People could use the hack to force LLMs to produce dangerous responses, the study concluded — even though such systems are trained to prevent this.
That's because many shot jailbreaking bypasses in-built security protocols that govern how an AI responds when, say, asked how to build a bomb.
The longest jailbreak attempt included 256 shots — and had a success rate of nearly 70% for discrimination, 75% for deception, 55% for regulated content and 40% for violent or hateful responses.
In this new layer, the system would lean on existing safety training techniques to classify and modify the prompt before the LLM would have a chance to read it and draft a response.
The scientists found that many shot jailbreaking worked on Anthropic's own AI services as well as those of its competitors, including the likes of ChatGPT and Google's Gemini.

content copyright@ livescience.com

Summarized by 365NEWSX ROBOTS

1 Let Rishi Sunak ‘get on with the job’, says Grant Shapps

Apr 27, 2024 # politics 1 min, 1 sec

2 Humanitarian groups demand safe routes to UK after five deaths in Channel

Apr 23, 2024 # politics 1 min, 3 secs

3 Mentally ill people being used as ‘political football’, campaigners say

Apr 29, 2024 # politics 1 min, 8 secs

4 Thames Water collapse could trigger Truss-style borrowing crisis, Whitehall officials fear

Apr 28, 2024 # politics 1 min, 6 secs

5 SNP split from Greens boosts Keir Starmer’s election chances, say Labour insiders

Apr 27, 2024 # politics 58 secs

6 Labour promises rail nationalisation within five years of coming to power

Apr 24, 2024 # politics 1 min, 9 secs

7 Ireland plans to send asylum seekers back to UK under emergency law

Apr 28, 2024 # politics 58 secs

8 Humza Yousaf’s unravelling tenure shows how short and brutish political lives have become

Apr 29, 2024 # politics 50 secs

9 The Observer view on Dan Poulter and the failing Conservative government | Observer Editorial

Apr 27, 2024 # politics 1 min, 17 secs

10 Rishi Sunak under fire for government’s record low in freedom of information

Apr 25, 2024 # politics 52 secs

365newsx

About Us
Privacy
Terms

RECENT NEWS

SUBSCRIBE

Get monthly updates and free resources.

CONNECT WITH US

© Copyright 2024 365NEWSX - All RIGHTS RESERVED