• 24 Posts
  • 926 Comments
Joined 2 years ago
cake
Cake day: June 23rd, 2023

help-circle




  • There are several “good” LLMs trained on open datasets like FineWeb, LAION, DataComp, etc.

    Then use those as training data. You’re too caught up on this exacting definition of open source that you’ll completely ignore the benefits of what this model could provide.

    an LLM could decide to, for example, summarize and compress some context full of trade secrets, then proceed to “search” for it, sending it to wherever it has access to.

    That’s not how LLMs work, and you know it. A model of weights is not a lossless compression algorithm.

    Also, if you’re giving an LLM free reign to all of your session tokens and security passwords, that’s on you.