RESEARCH AREA
Massive neural networks trained on text at internet scale.
BERT: Pre-training of Deep Bidirectional Transformers
Language Models are Few-Shot Learners (GPT-3)
GPT-4 Technical Report
Scaling Laws for Neural Language Models
Andrej Karpathy
Independent
Ilya Sutskever
Safe Superintelligence Inc.
Alec Radford
OpenAI