Rajat Patel
Home
Blog
Research
Projects
CV
Tags
architecture
1
attention
2
bpe
1
deep-learning
3
GPT
1
language-models
1
machine-learning
3
mdp
1
MoE
1
nlp
1
q-learning
1
reinforcement-learning
1
tokenization
1
transformers
2