[R] Tiny transformers (

[R] Tiny transformers (<100 params) can add two 10-digit numbers to 100% accuracy

Really interesting project. Crazy you can get such good performance. A key component is that they are digit tokens. Floating math will be way tricker.

submitted by /u/LetsTacoooo
[link] [comments]

Liked Liked