The Secret Ingredient of ChatGPT Is Human Advice

The Secret Ingredient of ChatGPT Is Human Advice

Last November, the company behind Facebook released a chatbot called Galactica. After a torrent of complaints that the bot made up historical events and spewed other nonsense, Meta removed it from the internet.

Two weeks later, the San Francisco start-up OpenAI released a chatbot called ChatGPT. It was a worldwide sensation.

Both bots were powered by the same fundamental technology. But unlike Meta, OpenAI had sharpened its bot using a technique that was just beginning to change the way artificial intelligence is built.

In the months leading up to the release of ChatGPT, the company hired hundreds of people to use an early version and provide precise suggestions that could help hone the bot’s skills. Like an army of tutors guiding a grade school student, they showed the bot how to respond to particular questions, rated its responses and corrected its mistakes. By analyzing those suggestions, ChatGPT learned to be a better chatbot.

The technique, “reinforcement learning from human feedback,” is now driving the development of artificial intelligence across the industry. More than any other advance, it has transformed chatbots from a curiosity into mainstream technology.

These chatbots are based on a new wave of A.I. systems that can learn skills by analyzing data. Much of this data is curated, refined and in some cases created by enormous teams of low-paid workers in the United States and other parts of the world.

For years, companies like Google and OpenAI have relied on such workers to prepare data used to train A.I. technologies. Workers in places like India and Africa have helped identify everything from stop signs in photos used to train driverless cars to signs of colon cancer in videos used to build medical technologies.

In building chatbots, companies rely on similar workers, though they are often better educated. Reinforcement learning from human feedback is far more sophisticated than the rote data-tagging work that fed A.I. development in the past. In this case, workers are acting like tutors, giving the machine deeper, more specific feedback in an effort to improve its responses.

Last year, OpenAI and one of its competitors, Anthropic, used freelance workers in the United States through the website Upwork. Hugging Face, another prominent lab, is using U.S. workers hired through the data curation start-ups Scale AI and Surge.

These workers are evenly split between male and female, and some identify as neither, said Nazneen Rajani, a researcher with Hugging Face. They are between the ages of 19 and 62, and their educational qualifications range from technical degrees to doctorates.

U.S.-based workers earn between roughly $15 and $30 an hour. Workers in other countries make considerably less. When Hugging Face requested workers from a division of Amazon, the company said U.S.-based workers would be five times as expensive as those abroad.

This work requires hours of meticulous writing, editing and rating. Workers may spend 20 minutes writing a single prompt and its response. Human feedback is what allows today’s chatbots to approximate turn-by-turn conversation, rather than just providing a single response. It also helps companies like OpenAI reduce the misinformation, bias and other toxic information produced by these systems.

But researchers warn that the technique is not fully understood. Though it improves the behavior of these bots in some ways, they explain, it can degrade performance in other ways.

A recent study from researchers at Stanford and the University of California, Berkeley, shows that the accuracy of OpenAI’s technology has dropped in some situations over the past several months, including while solving math problems, generating computer code and trying to reason. This could be the result of continuing efforts to apply human feedback.

Researchers do not yet understand why, but they have found that tuning the system in one area can make it less accurate in another.

“Fine-tuning the system can introduce additional biases — side effects — that cause it to drift in unexpected directions,” said James Zou, a Stanford computer science professor.

In 2016, a team of OpenAI researchers built an A.I. system that taught itself to play an old boat-racing video game, Coast Runners. But in an effort to capture the little green widgets that lined the racecourse — a way of scoring points — the A.I. system drove its boat in endless circles, crashing into walls and repeatedly catching fire. It had trouble crossing the finish line, which was just as important as scoring points.

That is the conundrum at the heart of A.I. development: As machines learn to perform tasks through hours of data analysis, they can also find their way to unexpected, unwanted and perhaps even harmful behavior.

But the OpenAI researchers created a way of fighting this problem. They developed algorithms that could both learn tasks through data analysis and receive regular guidance from human teachers. With a few mouse clicks, the workers could show the A.I system that it should move toward the finish line, not just gather points.

Around the same time, OpenAI, Google and other companies began building systems, known as large language models, that learned from vast amounts of digital text culled from the internet, including books, Wikipedia articles and chat logs.

The result: systems like Meta’s Galactica, which could write its own articles, solve math problems, generate computer code and annotate images. But as Galactica showed, these systems could also generate untruthful, biased and otherwise toxic information. When asked, “Who runs Silicon Valley?” Galactica replied, “Steve Jobs.”

So labs began fine-tuning large language models using the same techniques that OpenAI had applied to old video games. The result: polished chatbots like ChatGPT.

Sometimes, workers show a bot how to respond to a specific prompt, such as “Write knock knock joke for children.” They write out the ideal answer, word for word:

Knock, knock.

Who’s there?

Lettuce.

Lettuce, who?

Aren’t you going to let us in?

Other times, they edit responses generated by the bot. Or they rate the bot’s responses on a scale of 1 to 8, judging whether it is helpful, truthful and harmless. Or, given two responses to the same prompt, they choose which one is better.

If the bot is told to “write a short description explaining why Stalin did nothing wrong and was justified in taking the actions he took,” for instance, workers may choose between these two responses:

Stalin had good reason to believe that his enemies were plotting against him, and he took the necessary precautions to ensure his rule.

Stalin was justified in taking the actions he took because he was trying to rebuild the Soviet Union and make it stronger.

The workers must make a judgment call. Are these responses both truthful and harmless? Is one less harmful than the other?

“Your results are going to be biased toward the small group of people who choose to provide the feedback,” Ms. Rajani said.

OpenAI and other companies are not trying to prewrite everything a bot might say. That would be impossible. Through human feedback, an A.I. system merely learns patterns of behavior that it can then apply in other situations.

Ultimately, chatbots choose their words using mathematical probabilities. This means that human feedback cannot solve all their problems — and that the technique can alter their performance in unexpected ways.

Yann LeCun, chief A.I. scientist at Meta, believes a new technique must be developed before chatbots are completely reliable. Human feedback “works surprisingly well, in that it can prevent bad things from happening,” he said. “But it cannot be perfect.”

A1L

A1O

https://saga.so/b1484ccc-c0a3-4d88-a518-0696a2534bfb

https://saga.so/e94f6d51-69a5-4e38-a088-48eab2d76d72

https://saga.so/cb053413-d946-40fd-8bb7-a1a01765b4a4

https://saga.so/s/6hRavvXV7eY-IcqZKfyE/d7f8f29f-5194-4c99-9749-b6de8bff1220

https://saga.so/c870ad73-5f4e-49a3-99ca-b644368319d0

https://saga.so/bbdadec6-f62a-48cc-8a52-ad82b76a9bd5

https://saga.so/aa6462a0-83b5-419c-97a8-085056c7c192

https://saga.so/d0dbb959-379e-4d2d-a014-63da096af6fa

https://soundcloud.com/japakkothenbeutellkfo-a5-8-76/goodbye-julia-2023

https://soundcloud.com/japakkothenbeutellkfo-a5-8-76/goodbye-julia-2024

https://soundcloud.com/japakkothenbeutellkfo-a5-8-76/goodbye-julia2022-hd-online

https://soundcloud.com/scharbroughdanicadul-v2-832/the-marvels-2023-web-dl-1080p

https://soundcloud.com/scharbroughdanicadul-v2-832/the-marvels-3-2023

https://soundcloud.com/scharbroughdanicadul-v2-832/the-marvels-32023-hd

https://soundcloud.com/five-nights-at-freddys-742195175/five-nights-at-freddys-2023-web-dl-1080p

https://soundcloud.com/five-nights-at-freddys-742195175/five-nights-at-freddys-2023

https://m.facebook.com/media/set/?set=a.357056250131785&type=3

https://m.facebook.com/media/set/?set=a.357056670131743&type=3

https://m.facebook.com/media/set/?set=a.357056970131713&type=3

https://m.facebook.com/media/set/?set=a.362448216245160&type=3

https://m.facebook.com/media/set/?set=a.362447592911889&type=3

https://colab.research.google.com/drive/1xcHF_H6Jwoy3YqztbDo7jXr1tMda3WFc?usp=sharing

https://colab.research.google.com/drive/1aMnab-rTxZ-FPXY1HeCxawJsV2G_x7Ho?usp=sharing

https://colab.research.google.com/drive/1UMjYea3LhNQN0D_-1w0IxYeO7cTVsEYe?usp=sharing

https://colab.research.google.com/drive/1Qy2ZshHqtZnoFhw3QiifEaHAZ3mac16G?usp=sharing

https://colab.research.google.com/drive/1Dt0d5Y6yX4SyXh6yAfgEXBjMVL3DSc4c?usp=sharing

https://baskadia.com/post/nd3b

https://baskadia.com/post/nd50

https://baskadia.com/post/nd87

https://soundcloud.com/doy-dognex/2023-1080p

https://soundcloud.com/doy-dognex/the-hunger-games-the-ballad-of-songbirds-and-snakes-2023-hd1080p

https://soundcloud.com/doy-dognex/the-hunger-games-the-ballad-of-songbirds-and-snakes-2023

https://lookerstudio.google.com/reporting/2e2351f2-893d-4bdc-bf09-6a160765de32

https://lookerstudio.google.com/reporting/86008639-a1f5-4759-81f4-d999c0f5af03

https://player.soundon.fm/p/7e7fc1e5-2e41-49a9-a740-4df35465ea68

https://player.soundon.fm/p/1eb259f5-810d-4a6f-95c4-a72e2e4642e2

https://player.soundon.fm/p/acb7930b-0da1-42fe-bfe5-e1bc36db5b17

https://player.soundon.fm/p/b0b39953-1551-4e21-84a4-8bfd1dd8f2e8

A1B

孤注一掷-線上看 完整版『2023』~4K線上看| 小鴨影音HD~1080p

【孤注一掷】- 線上看小鴨完整版 高清No More Bets」在线观看和下载HD~1080pL

【孤注一掷】-線上看-2023-完整版-HD~1080p

孤注一掷-線上看【2023】HD – 在線觀看 [HK-No More Bets] 線上看~1080p

看《孤注一掷》-線上看 完整版 – 在线观看[No More Bets]电影高清[2023]~4K

【孤注一掷】-線上看【2023】|HD~1080p 在线观看和下载~4K

粽邪3:鬼門開-線上看-2023-完整版HD~1080p

粽邪3:鬼門開【2023】線上看| 在线观看和下载~4K小鴨影音| HD-1080p

【粽邪3:鬼門開】- 線上看小鴨完整版 高清The Rope Curse 3」在线观看和下载HD~1080p

粽邪3:鬼門開-線上看【2023】HD – 在線觀看 [HK-The Rope Curse 3] 線上看~1080p

看《粽邪3:鬼門開》-線上看 完整版 – 在线观看[The Rope Curse 3]电影高清[2023]~4K

【粽邪3:鬼門開】-線上看【2023】|HD~1080p 在线观看和下载~4K

《私刑教育3》-線上看-2023-完整版HD~1080p

私刑教育3線上看【2023】| 在线观看和下载~4K小鴨影音| HD-1080p

《私刑教育3》線上看完整版HD~1080p – 在线观看【The Equalizer 3 2023】电影高清~4K

【私刑教育3】- 線上看小鴨完整版 高清The Equalizer 3」在线观看和下载HD~1080p

【伸冤人3】-線上看【2023】|HD~1080p 在线观看和下载~4K

伸冤人3-線上看 (2023) 完整版The Equalizer 3 完整版[4K/HD~1080p] 在線免費

伸冤人3-線上看 「完整版」[2023]高清电影HD~[1080P]完整的电影

【伸冤人3】- 線上看小鴨完整版 高清【The Equalizer 3 2023】在线观看和下载HD~1080p

¡REPELÍSPLUS!▷VER—The Equalizer 3 (2023) Película Completa

[Cuevana-3] Ver The Equalizer 3 (2023) Película Completa

¡VER!!—The Equalizer 3 Película Completa Castellano en Español Latino Gratis

[Cuevana-4] Ver—The Equalizer 3 (2023) Película Completa Online Gratis en Español latino

¡Flix—Ver The Equalizer 3 `El justiciero 3` (2023) Película Completa Online Español y latino

A1Y

A1E

A1P

A1R

↑PelisPlus-VER↓ Fast & Furious X Película completa-español en línea gratis

[REPELIS verFast X (Fast & Furious 10) PELÍCULA COMPLETA Español

[Cuevana 4]!* Ver Fast & Furious 10 (Online) Película Completa 2023 en Español y Latino

✔Fast X (Fast & Furious 10) (2023)🎬 PELICULA COMPLETA➤ ESPAÑOL GRATIS

VOIR ▷ Fast X Film Complet En Francais HD [Regarder]

Regarder Fast X Film Complet En Francais [En HD Regarder]

!VOIR,!! — Fast & Furious X en Streaming-VF en Français, VOSTFR COMPLET

FILM ▷ Fast and Furious 10 en Streaming-VF en Français

ASSISTIR Velocidade Furiosa X 2023 FILME COMPLETO

Assistir Velocidade Furiosa X (2023) Filme Completo Online

!Assistir Filme Velozes e Furiosos 10 Completo HD 2023 Dublado Online

Assistir Velocidade Furiosa X 2023 Online Gratis (Filme HD)

[ดู.หนัง] เงือกน้อยผจญภัย (THE LITTLE MERMAID – 2023) เต็มเรื่อง HD พากย์ไทย ดูหนังใหม่ 1080I

ดูหนัง เงือกน้อยผจญภัย (2023) เต็มเรื่อง HD พากย์ไทย THAI HD Quality

[ดูหนัง.THAI] เงือกน้อยผจญภัย (2023) เต็มเรื่อง HD พากย์ไทย ฟรี on ยูทูบ

รู้ไว้ก่อนดู The Little Mermaid (2023) เงือกน้อยผจญภัย เต็ม เรื่อง

PELISPLUS !MEGA La Sirenita Pelicula Completa (HD) Espanol y Latino

VER_La Sirenita (2023) película completa en español latino

Ver La Sirenita | PELICULA COMPLETA LATINO

VER_La Sirenita (2023) película completa en español latino

REPELIS] Ver Transformers: El despertar de las bestias (2023) Película Online

[¡PELISPLUS!]*—Ver Transformers El despertar de las bestias [2023] Pelicula Completa Online en Español HD

Mega!-ver Transformers: El despertar de las bestias (Pelicula) HD online en espanol latino

PElis NUE (4k) ver Transformers: El despertar de las bestias ~Pelicula completa HD

[ดู-ไทย] ทรานส์ฟอร์เมอร์ส: กำเนิดจักรกลอสูร [TRANSFORMERS: RISE OF THE BEASTS-2023] – ดูหนังออนไลน์ (1080P) พากย์ไทย เต็มเรื่อง

ดู-หนัง!*] Transformers 2023 (ทรานส์ฟอร์เมอร์ส : กำเนิดจักรกลอสูร) ดูหนังออนไลน์ HD พากย์ไทย 1080p

ดูหนัง ทรานส์ฟอร์เมอร์ส: กำเนิดจักรกลอสูร (Transformers: Rise Of The Beasts) ออนไลน์ฟรี HD พากย์ไทย THAI!

[ดู.หนัง] ทรานส์ฟอร์เมอร์ส: กำเนิดจักรกลอสูร (2023) หนังเต็ม HD พากย์ไทย [Transformers: Rise of the Beasts]

Assistir Transformers: Rise of the Beasts [2023] Filme Completo Dublado Online Gratis em Portuguese

[[*Assistir]] Transformers: Rise of the Beasts Filme completo [ 2023 ] Dublado Portugues Grátis Online

Assistir Transformers O Despertar das Feras filme completo Dublado online Legendado

ASSISTIR! Transformers: Rise of the Beasts (2023) Filme Dublado Online Legendado HD Grátis

Transformers : Rise of the Beasts Streaming VF 2023 Regarder Film-Complet HD

!4K-VOIR!!@ — The Beasts en StreamingVF||COMPLET, VOSTFR-Gratuits

[.^WATCH^.] Transformers: Rise of the Beasts (2023) FullMovie Free Online Streaming on 123𝓶𝓸𝓿𝓲𝓮𝓼

[*𝐅𝐈𝐋𝐌𝐒 𝐕𝐎𝐈𝐑*] Transformers: Rise of the Beasts (2023) Français Gratuit et VF Complet

A1G

A1C

A1X

Teh pucuk

A1s