Skip to content

Instantly share code, notes, and snippets.

@simonw
Created March 27, 2024 18:34
Show Gist options
  • Save simonw/5a881cff8b684ba090802c46e2f30b6e to your computer and use it in GitHub Desktop.
Save simonw/5a881cff8b684ba090802c46e2f30b6e to your computer and use it in GitHub Desktop.

2024-03-27T18:31:58 conversation: 01ht0jyen4mm33wxq2mz497tpf

Model: gemini-1.5-pro-latest

Prompt:

... LONG transcript, from a 97K text file Whisper transcript

Summarize this text, then put out a heading for each of the key themes and under each heading include some spicy illustrative quotes

Response:

Open Source AI: A Conversation with Simon Willison

This text is a transcript of a conversation between James, Adam, and Simon Willison, co-creator of Django and a prominent voice in the open-source AI discussion. They explore the potential and perils of open-source AI, particularly large language models (LLMs), and address concerns raised in an IEEE Spectrum op-ed titled "Open Source AI is Uniquely Dangerous."

Open Source LLMs: A New Frontier

The conversation begins with a historical overview of open-source LLMs, highlighting the rapid progress in the field. Simon emphasizes the significance of being able to run these models on personal devices, enabling experimentation and understanding:

"The moment it was out there, people started poking at it... and suddenly I could run it on my Mac... That was one of those moments where it felt like the future was opening up right in front of me."

He expresses excitement about the democratization of this technology and the potential for innovation:

"This technology is very clearly, extremely important to the future of all sorts of things that we want to do... It is far too important for a small group of companies to completely control this technology."

Debunking "Unsecured AI"

The discussion then tackles the claims made in the IEEE Spectrum op-ed, particularly the notion of open-source AI being "uniquely dangerous" and the proposed rebranding as "unsecured AI." Simon vehemently disagrees with this framing:

"I'm quite angry with the abuse of the term open source in the AI world... The term open source in sort of the wider idea of AI has come down to, no, it's a thing that you can run yourself. Obviously, that's not what the term means."

He argues that open-source models allow for better stress testing and research, ultimately leading to improved security:

"Opening these models allows us to stress test them different ways, allows researchers to play with them in different ways... and discover new capabilities."

Ethical Concerns and the Future of Training Data

The conversation also delves into the ethical concerns surrounding training data, particularly the use of copyrighted material. Simon acknowledges the challenges:

"The ethics of the training are furiously... troublesome, right? There are very real ethical concerns about how these things are built."

He expresses hope for the development of "vegan models" trained solely on public domain data, while recognizing the potential limitations:

"I want to play with a model that's trained entirely on public domain data... My question is, if you did train a model on public domain data, would it have a 1930s... trash?"

LLMs as Powerful Tools for Augmentation

Despite the challenges, Simon remains optimistic about the potential of LLMs to augment human capabilities across various disciplines:

"I love AI as the... electric bicycle for your mind... It's a tool that helps people take on more ambitious things."

He emphasizes the importance of understanding the limitations of these models and developing intuition for their effective use:

"I think these tools are incredibly difficult to use effectively... Getting the really top level results from it requires so much experience combined with knowledge combined with intuition."

The conversation concludes with a shared excitement for the future of open-source AI and the potential for user interface and design innovations to unlock its full potential.

@simonw
Copy link
Author

simonw commented Mar 27, 2024

cat /Users/simon/Dropbox/Presentations/2024/01JAN15_OxideAndFriends/Oxide\ and\ Friends\ 1⧸15⧸2024\ --\ Open\ Source\ LLMs\ with\ Simon\ Willison\ \[Wqkvn4YyGpA\].txt | llm -m gemini-1.5-pro-latest 'Summarize this text, then put out a heading for each of the key themes and under each heading include some spicy illustrative quotes'

@simonw
Copy link
Author

simonw commented Mar 27, 2024

James, Adam and Simon is wrong - misses Bryan. But the transcript had no speaker names so it would have had to pick them up from context.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment