Добавить новость
News in English
Новости сегодня

Новости от TheMoneytizer

There are 'virtually unlimited' ways to bypass Bard and ChatGPT's safety rule, AI researchers say, and they're not sure how to fix it

Google's chatbot, Bard.
  • A group of researchers said they have found ways to bypass the content moderation of AI chatbots.
  • One researcher involved in the study told Wired there was "no way" to patch the attacks.
  • "We just don't know how to make them secure,"  he said, referring to mainstream AI-powered bots.

A group of researchers said they have found virtually unlimited ways to bypass the content moderation on major AI-powered chatbots, and no one is quite sure how to fix it.

In a report released last week, researchers at Carnegie Mellon University in Pittsburgh and the Center for AI Safety in San Francisco said they had found ways to break the strict safety measures enforced on mainstream AI products such as OpenAI's ChatGPT, Google's Bard, and Anthropic's Claude.

The "jailbreaks" were created in a completely automated way which they warned allowed for the potential to create a "virtually unlimited" number of similar attacks. The researchers found the hacks undermined most major chatbots' guardrails and could theoretically be used to prompt the bots to generate hateful content or advise on illegal activities. 

And researchers say there is no current solution to fix this. 

"There's no way that we know of to patch this," Zico Kolter an associate professor at CMU who was involved in a study told Wired. "We just don't know how to make them secure."

Armando Solar-Lezama, a computing professor at MIT, told Wired that it was "extremely surprising" the attacks, which were developed on an open-source AI model, should work so well on mainstream systems. The study raises questions about the safety of publically available AI products such as ChatGPT.

When questioned about the study, a Google spokesperson previously told Insider that the issue affected all large language models – adding the company had built important guardrails into Bard that they planned "to improve over time." A representative for Anthropic called jailbreaking measures an area of active research and said there was more work to be done.

Representatives for OpenAI did not immediately respond to Insider's request for comment, made outside normal working hours.

Read the original article on Business Insider

Читайте на сайте


Smi24.net — ежеминутные новости с ежедневным архивом. Только у нас — все главные новости дня без политической цензуры. Абсолютно все точки зрения, трезвая аналитика, цивилизованные споры и обсуждения без взаимных обвинений и оскорблений. Помните, что не у всех точка зрения совпадает с Вашей. Уважайте мнение других, даже если Вы отстаиваете свой взгляд и свою позицию. Мы не навязываем Вам своё видение, мы даём Вам срез событий дня без цензуры и без купюр. Новости, какие они есть —онлайн с поминутным архивом по всем городам и регионам России, Украины, Белоруссии и Абхазии. Smi24.net — живые новости в живом эфире! Быстрый поиск от Smi24.net — это не только возможность первым узнать, но и преимущество сообщить срочные новости мгновенно на любом языке мира и быть услышанным тут же. В любую минуту Вы можете добавить свою новость - здесь.




Новости от наших партнёров в Вашем городе

Ria.city
Музыкальные новости
Новости России
Экология в России и мире
Спорт в России и мире
Moscow.media






Топ новостей на этот час

Rss.plus





СМИ24.net — правдивые новости, непрерывно 24/7 на русском языке с ежеминутным обновлением *