rain-1/On Llamafile.md

## On Llamafile.md

      
    Raw
  

              On Llamafile.md
            
          
    On Llamafile not making sense

The LLamafile project doesn't make sense.
The claim is that it is "bringing LLMs to the people", but you could already run an LLM - which is a large binary file containing lots of floating point numbers - by using llama.cpp.
Llamafile joins a compiled binary program to run LLMs with a weights binary into a single file. This isn't a useful goal. you could simply distribute a zip containing an .exe and a weights file together. Or better still: Decouple the program that runs these chatbots from the chatbot weights.
Imagine if PNG files were also an executable that could pop open a window that displays a PNG on your computer. There is a reason we don't do this: It's not good engineering.
There is a claim that llamafile is simpler or more convenient, because it's one thing. But it's actually not hard to download two things: an .exe for windows or mac or whatever and an LLM binary.
And honestly having your executable contain so much random looking binary data does not seem like a good choice in terms of security either.