DinoDS isn’t “more scraped data.” It’s behavior engineering for LLMs.
|
I don’t think the interesting question anymore is “how much data did you scrape?” It’s: That’s how we’ve been thinking about DinoDS. Not as one giant text pile, but as narrower training slices for things like:
The raw data matters, obviously. But the real value feels more and more like: That’s the shift I’m most interested in right now. Less scraping. Curious if others here are thinking about datasets the same way. Check it www.dinodsai.com :)) submitted by /u/JayPatel24_ |
Like
0
Liked
Liked