NA
@n4k3dturtl3
I drink and I hack things
ID: 867464784379166720
24-05-2017 19:38:16
1,1K Tweet
1,1K Followers
763 Following
Assignment 1 (get basic pipeline working): implement BPE tokenizer, Transformer architecture, Adam optimizer, train models on TinyStories and OpenWebText. Only PyTorch primitives are allowed (canโt just call torch.nn.Transformer or even torch.nn.Linear). github.com/stanford-cs336โฆ