softmax1
Popular repositories Loading
-
Flash-Attention-Softmax-N
Flash-Attention-Softmax-N PublicCUDA and Triton implementations of Flash Attention with SoftmaxN.
-
-
nanoGPT_softmax1
nanoGPT_softmax1 PublicAn experiment using nanoGPT vs nanoGPT (softmax1) to see how it affects perplexity score
Python
-
nanoGPT_softmax1_reddit
nanoGPT_softmax1_reddit PublicForked from karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Python
-
Repositories
Showing 7 of 7 repositories
- MosaicBERT-Softmax1 Public
softmax1/MosaicBERT-Softmax1’s past year of commit activity - nanoGPT_softmax1 Public
An experiment using nanoGPT vs nanoGPT (softmax1) to see how it affects perplexity score
softmax1/nanoGPT_softmax1’s past year of commit activity - nanoGPT_softmax1_reddit Public Forked from karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
softmax1/nanoGPT_softmax1_reddit’s past year of commit activity