Ep 83 - Pablo Samuel Castro (Google) - Reinforcement Learning, feedback de humanos y ChatGPT

Hacia Afuera con Omar Espejel

Feb 17 2023 • 51 mins

Pablo Samuel Castro (@pcastr en Twitter) es Staff Research Software Developer en Google donde ha trabajado durante más de 11 años. Su enfoque se centra en Reinforcement Learning. Tiene un doctorado en Computer Science en la McGill University. En este episodio, Pablo nos cuenta cómo funciona el Reinforcement Learning (RL) y el RL from Human Feedback (RLHF), clave para el desarrollo de modelos de lenguaje como el ChatGPT. Pablo también aplica el RL a actividades creativas como la música y nos platica sobre ello.

You Might Like

TED Radio Hour

NPR

Acquired

Ben Gilbert and David Rosenthal

Darknet Diaries

Darknet Diaries

Jack Rhysider

Hard Fork

The New York Times

All-In with Chamath, Jason, Sacks & Friedberg

All-In with Chamath, Jason, Sacks & Friedberg

All-In Podcast, LLC

Kim Komando Today

Kim Komando Today

Kim Komando

This Week in Tech (Audio)

This Week in Tech (Audio)

TWiT

WSJ’s The Future of Everything

WSJ’s The Future of Everything

The Wall Street Journal

PJ Vogt, Audacy, Jigsaw

Marketplace Tech

Marketplace Tech

Marketplace

Elon Musk Podcast

Elon Musk Podcast

Stage Zero

Rich On Tech

Rich DeMuro

Daily Tech News Show

Daily Tech News Show

Tom Merritt

Security Now (Audio)

Security Now (Audio)

TWiT

Ask The Tech Guys (Audio)

Ask The Tech Guys (Audio)

TWiT

TechStuff

iHeartPodcasts

Endless Thread

WBUR

Fortnite Emotes

Fortnite Emotes

Lawrence Hopkinson

The Kim Komando Show

The Kim Komando Show

Kim Komando

The Vergecast

The Verge