from https://github.com/underlines/awesome-marketing-datascience/blob/master/llm-tools.md
- chatgptui/desktop
- chatbox is a Windows, Mac & Linux native ChatGPT Client
- BingGPT Desktop application of new Bing's AI-powered chat
- cheetah Speech to text for remote coding interviews, giving you hints from GTP3/4
cpp / ggml:
- llama.cpp runs ggml models up to 4-bit quantized on mac, linux and windows natively. Supports the new ggmlv3 format and runs on CPU and GPU. Allows for mixed use of CPU/GPU using BLAS libraries like cuBLAS, CLBLas etc.
- Alpaca.cpp
- koboldcpp llama.cpp with a fancy UI, persistent stories, editing tools, memory etc. Supporting ggmlv3 and old ggml, CLBlast and llama, RWKV, GPT-NeoX, Pythia models
- Serge chat interface based on llama.cpp for running Alpaca models. Entirely self-hosted, no API keys needed
- llama MPS inference on Apple Silicon GPU using much lower power but is slightly slower than llama.cpp which uses CPU
- bloomz.cpp Inference of HuggingFace's BLOOM-like models in pure C/C++
- RWKV.cpp CPU only port of BlinkDL/RWKV-LM to ggerganov/ggml. Supports FP32, FP16 and quantized INT4.
- RWKV Cuda a torchless, c++ rwkv implementation with 8bit quantization written in cuda
- secondbrain Multi-platform desktop app to download and run LLMs locally in your computer
gpt4all:
- gpt4all terminal and gui version to run local gpt-j models, compiled binaries for win/osx/linux
- gpt4all.zig terminal version of GPT4All
- gpt4all-chat Cross platform desktop GUI for GPT4All models (gpt-j)
others:
- Lit-llama training, fine tuning and inference of llama
- Dalai LLaMA-based ChatGPT for single GPUs
- ChatLLaMA LLaMA-based ChatGPT for single GPUs
- mlc-llm, run any LLM on any hardware (iPhones, Android, Win, Linux, Mac, WebGPU, Metal. NVidia, AMD)
- faraday.dev Run open-source LLMs on your Win/Mac. Completely offline. Zero configuration.
- ChatALL concurrently sends prompts to multiple LLM-based AI bots both local and APIs and displays the results
- pyllama hacked version of LLaMA based on Meta's implementation, optimized for Single GPUs
- gmessage visually pleasing chatbot that uses a locally running LLM server and supports multiple themes, chat history search, text to speech, JSON file export, and OpenAI API compatible Python code
- selfhostedAI one-click deployment of RWKV, ChatGLM, llama.cpp models for substituting the openAI API to a locally hosted API
- TypingMind
- Chatwithme.chat
- enricoros/nextjs-chatgpt-app
- no8081/chatgpt-demo
- IPython-gpt use chatGPT directly inside jupyter notebooks
- Chatbot UI An open source ChatGPT UI
- Text Generation Webui An all purpose UI to run LLMs of all sorts with optimizations (running LLaMA-13b on 6GB VRAM, HN Thread)
- Text Generation Webui Ph0rk0z fork supporting all GPTQ versions and max context of 8192 instead of 4096 (because some models support longer context now)
- Alpaca-LoRa-Serve
- chat petals web app + HTTP and Websocket endpoints for BLOOM-176B inference with the Petals client
- Alpaca-Turbo Web UI to run alpaca model locally on Win/Mac/Linux
- FreedomGPT Web app that executes the FreedomGPT LLM locally
- HuggingChat open source chat interface for transformer based LLMs by Huggingface
- openplayground enables running LLM models on a laptop using a full UI, supporting various APIs and local HuggingFace cached models
- gpt4all Web UI user friendly all-in-one interface, runs gpt_j, gptq, ggml and other model types
- RWKV-Runner Easy installation and running of RWKV Models, providing a local OpenAI API, GUI and custom CUDA kernel acceleration. Supports 2gb up to 32gb VRAM
- BrainChulo Chat App with vector based Long-Term Memory supporting one-shot, few-shot and Tool capable agents
- datafilik/GPT-Voice-Assistant
- Abdallah-Ragab/VoiceGPT
- LlmKira/Openaibot
- BarkingGPT Audio2Audio by using Whisper+chatGPT+Bark
- gpt_chatbot Windows / elevenlabs TTS + pinecone long term memory
- gpt-voice-conversation-chatbot using GPT3.5/4 API, elevenlab voices, google tts, session long term memory
- JARVIS-ChatGPT conversational assistant that uses OpenAI Whisper, OpenAI ChatGPT, and IBM Watson to provide quasi-real-time tips and opinions.
- ALFRED LangChain Voice Assistant, powered by GPT-3.5-turbo, whisper, Bark, pyttsx3 and more
- bark TTS for oobabooga/text-generation-webui make your local LLM talk
- bark TTS for oobabooga/text-generation-webui another implementation
- iris-llm local voice chat agent
- sqlchat Use OpenAI GPT3/4 to chat with your database
- chat-with-github-repo which uses streamlit, gpt3.5-turbo and deep lake to answer questions about a git repo
- mpoon/gpt-repository-loader uses Git and GPT-4 to convert a repository into a text format for various tasks, such as code review or documentation generation.
- LlamaIndex provides a central interface to connect your LLM's with external data
- Llama-lab home of llama_agi and auto_llama using LlamaIndex
- PrivateGPT a standalone question-answering system using LangChain, GPT4All, LlamaCpp and embeddings models to enable offline querying of documents
- Spyglass tests an Alpaca integration for a self-hosted personal search app. Select the llama-rama feature branch. Discussion on reddit
- local_llama chatting with your PDFs offline. gpt_chatwithPDF alternative with the ultimate goal of using llama instead of chatGPT
- Sidekick Information retrieval for LLMs
- DB-GPT SQL generation, private domain Q&A, data processing, unified vector storage/indexing, and support for various plugins and LLMs
- localGPT a privateGPT inspired document question-answering solution using GPU instead of CPU acceleration and InstructorEmbeddings, which perform better according to leaderboards instead of LlamaEmbeddings
- LocalDocs plugin for GPT4All
- annoy_ltm extension to add long term memory to chatbots using a nearest neighbor vector DB for memory retrieval
- ChatDocs PrivateGPT + Web UI + GPU Support + ggml, transformers, webui
- PAutoBot document question-answering engine developed with LangChain, GPT4All, LlamaCpp, ChromaDB, PrivateGPT, CPU only
- Paper QA LLM Chain for answering questions from documents with citations, using OpenAI Embeddings or local llama.cpp, langchain and FAISS Vector DB
- BriefGPT document summarization and querying using OpenAI' and locally run LLM's using LlamaCpp or GPT4ALL, and embeddings stored as a FAISS index, built using Langchain.
- sider chrome side-bar for chatGPT and OpenAI API supporting custom prompts and text highlighting
- chathub-dev/chathub
- Glarity open-source chrome extension to write summaries for various websites including custom ones and YouTube videos. Extensible
- superpower-chatgpt chrome extension / firefox addon to add missing features like Folders, Search, and Community Prompts to ChatGPT
- chatGPTBox add useful LLM chat-boxes to github and other websites, supporting self-hosted model (RWKV, llama.cpp, ChatGLM)
- Auto GPT
- AgentGPT Deploy autonomous AI agents, using vectorDB memory, web browsing via LangChain, website interaction and more including a GUI
- microGPT Autonomous GPT-3.5/4 agent, can analyze stocks, create art, order pizza, and perform network security tests
- Auto GPT Plugins
- AutoGPT-Next-Web An AgentGPT fork as a Web GUI
- AutoGPT Web
- AutoGPT.js
- LoopGPT a re-implementation of AutoGPT as a proper python package, modular and extensible
- Camel-AutoGPT Communicaton between Agents like BabyAGI and AutoGPT
- BabyAGIChatGPT is a fork of BabyAGI to work with OpenAI's GPT, pinecone and google search
- GPT Assistant An autonomous agent that can access and control a chrome browser via Puppeteer
- gptchat a client which uses GPT-4, adding long term memory, can write its own plugins and can fulfill tasks
- Chrome-GPT AutoGPT agent employing Langchain and Selenium to interact with a Chrome browser session, enabling Google search, webpage description, element interaction, and form input
- autolang Another take on BabyAGI, focused on workflows that complete. Powered by langchain.
- ai-legion A framework for autonomous agents who can work together to accomplish tasks.
- generativeAgent_LLM Generative Agents with Guidance, Langchain, and local LLMs, implementation of the "Generative Agents: Interactive Simulacra of Human Behavior" paper, blogpost
- Auto Vicuna Butler Baby-AGI fork / AutoGPT alternative to run with local LLMs
- BabyAGI AI-Powered Task Management for OpenAI + Pinecone or Llama.cpp
- Agent-LLM Webapp to control an agent-based Auto-GPT alternative, supporting GPT4, Kobold, llama.cpp, FastChat, Bard, Oobabooga textgen
- auto-llama-cpp fork of Auto-GPT with added support for locally running llama models through llama.cpp
- AgentOoba autonomous AI agent extension for Oobabooga's web ui
- RecurrentGPT Interactive Generation of (Arbitrarily) Long Text. Uses LSTM, prompt-engineered recurrence, maintains short and long-term memories, and updates these using semantic search and paragraph generation.
- SuperAGI open-source framework that enables developers to build, manage, and run autonomous agents. Supports tools extensions, concurrent agents, GUI, console, vector DBs, multi modal, telemetry and long term memory
- Alpaca-Turbo | Web UI to run alpaca model locally on Win/Mac/Linux
- FreedomGPT | Web app that executes the FreedomGPT LLM locally
- huggingGPT / JARVIS Connects LLMs with huggingface specialized models
- Langchain-huggingGPT reimplementation of HuggingGPT using langchain
- OpenAGI AGI research platform, solves multi step tasks with RLTF and supports complex model chains
- ViperGPT implementation for visual inference and reasoning with openAPI
- TaskMatrix former visual-chatgpt connects ChatGPT and a series of Visual Foundation Models to enable sending and receiving images during chatting.
- PandaGPT combines ImageBind and Vicuna to understand and combine multimodal inputs from text, image, audio, depth, thermal, and IMU.
- AGiXT agents with memory, model agnostic, docker deployment, plugin extendable, chat feature, speech to text and text to speech, REST api and more
- FauxPilot open source Copilot alternative using Triton Inference Server
- Turbopilot open source LLM code completion engine and Copilot alternative
- Tabby Self hosted Github Copilot alternative
- starcoder.cpp
- GPTQ-for-SantaCoder 4bit quantization for SantaCoder
- supercharger Write Software + unit tests for you, based on Baize-30B 8bit, using model parallelism
- Autodoc toolkit that auto-generates codebase documentation using GPT-4 or Alpaca, and can be installed in a git repository in about 5 minutes.
- smol-ai developer a personal junior developer that scaffolds an entire codebase with a human-centric and coherent whole program synthesis approach using <200 lines of Python and Prompts.
- locai kobold/oobabooga -compatible api for vscode
- oasis local LLaMA models in VSCode
- acheong08/ChatGPT Python reverse engineerded chatGPT API
- gpt4free Use reverse engineered GPT3.5/4 APIs of other website's APIs
- GPTCache, serve cached results based on embeddings in a vector DB, before querying the OpenAI API.
- kitt TTS + GPT4 + STT to create a conference call audio bot
- Marvin simplifies AI integration in software development with easy creation of AI functions and bots managed through a conversational interface
- chatgpt.js client-side JavaScript library for ChatGPT
- ChatGPT-Bridge use chatGPT plus' GPT-4 as a local API
- Powerpointer connects to openAPI GPT3.5 and creates a powerpoint out of your content
- EdgeGPT Reverse engineered API of Microsoft's Bing Chat using Edge browser
- FastLLaMA Python wrapper for llama.cpp
- WebGPT Inference in pure javascript
- TokenHawk performs hand-written LLaMA inference using WebGPU, utilizing th.cpp, th-llama.cpp, and th-llama-loader.cpp, with minimal dependencies
- WasmGPT ChatGPT-like chatbot in browser using ggml and emscripten
- AutoGPTQ easy-to-use model GPTQ quantization package with user-friendly CLI
- gpt-llama.cpp Replace OpenAi's GPT APIs with llama.cpp's supported models locally
- llama-node JS client library for llama (or llama based) LLMs built on top of llama-rs and llama.cpp.
- TALIS serves a LLaMA-65b API, optimized for speed utilizing dual RTX 3090/4090 GPUs on Linux
- Powerpointer-For-Local-LLMs connects to oobabooga's API and creates a powerpoint out of your content
- OpenChatKit open-source project that provides a base to create both specialized and general purpose chatbots and extensible retrieval system, using GPT-NeoXT-Chat-Base-20B as a base model
- webgpu-torch Tensor computation with WebGPU acceleration
- llama-api-server that uses llama.cpp and emulates an openAI API
- CTransformers python bindings for transformer models in C/C++ using GGML library, supporting GPT-2/J/NeoX, StableLM, LLaMA, MPT, Dollyv2, StarCoder
- basaran GUI and API as a drop-in replacement of the OpenAI text completion API. Broad HF eco system support (not only llama)
- CodeTF one-stop Python transformer-based library for code LLMs and code intelligence, training and inferencing on code summarization, translation, code generation
- LangChain | framework for developing LLM applications (example, paolorechia/learn-langchain with vicuna and GPTQ 4 bit support)
- LangFlow GUI for Langchain
- Toolformer implementation Allows LLMs to use Tools
- megabots to create LLM bots by providing Q&A, document retrieval, vector DBs, FastAPI, Gradio UI, GPTCache, guardrails, whisper, supports OpenAI API (local LLMs planned)
- Jsonformer: Generate Structured JSON from Language Models by handling JSON synthax, and letting LLM just output the values
- gorilla Enables LLMs to use tools by semantically and syntactically correctly invoking APIs. Reduces hallucination, custom trained model weights based on llama-7b
- simple llama finetuner
- LLaMA-LoRA Tuner
- alpaca-lora
- StackLLaMA Fine-Tuning Guide by huggingface
- xTuring LLM finetuning pipeline supporting LoRa & 4bit
- Microsoft DeepSpeed Chat
- How to train your LLMs
- H2O LLM Studio | Framework and no-code GUI for fine tuning SOTA LLMs
- Implementation of LLaMA-Adapter, to fine tune instructions within hours
- Hivemind Training at home
- Axolotl a llama, pythia, cerebras training environment optimized for Runpod supporting qlora, 4bit, flash attention, xformers
- LMFlow toolbox for finetuning, designed to be user-friendly, speedy, and reliable
- qlora uses bitsandbytes quantization and PEFT and transformers for efficient finetuning of quantized LLMs
- GPTQlora Efficient Finetuning of Quantized LLMs with GPTQ QLoRA and AutoGPTQ for quantization
- Vicuna FastChat
- SynapseML (previously known as MMLSpark),an open-source library that simplifies the creation of massively scalable machine learning (ML) pipelines
- Microsoft guidance efficient Framework for Enhancing Control and Structure in Modern Language Model Interactions. Demo project by paolorechia for local text-generation-webui. reddit thread. guidance fork and llama-cpp-python fork how-to on reddit
- Microsoft semantic-kernel a lightweight SDK enabling integration of AI Large Language Models (LLMs) with conventional programming languages
- Colossal-AI unified deep learning system that provides a collection of parallel components for distributed deep learning models. Provides data parallelism, pipeline parallelism, and tensor parallelism
- Alpaca-lora instruction finetuned using Low Rank Adaption
- codealpaca Instruction training data set for code generation
- LAION AI / Open-Assistant Dataset (https://github.com/LAION-AI/Open-Assistant / https://projects.laion.ai/Open-Assistant/ / https://open-assistant.io)
- ShareGPT pre-cleaned, English only "unfiltered," and 2048 token split version of the ShareGPT dataset ready for finetuning
- Vicuna ShareGPT pre-cleaned 90k conversation dataset
- Vicuna ShareGPT unfiltered
- GPTeacher
- alpaca-cleaned
- codealpaca 20k
- gpt3all pruned
- gpt4all_prompt_generations_with_p3
- gpt4all_prompt_generations
- alpaca-plus-gpt4all-without-p3
- Alpaca dataset from Stanford, cleaned and curated
- Alpaca Chain of Thought fine tuning dataset for EN and CN
- PRESTO paper Multilingual dataset for parsing realistic task-oriented dialogues by Google & University of Rochester, California, Santa Barbara, Columbia
- RedPajama Dataset and model similar to LLaMA but truly open source and ready for commercial use. hf
- BigCode The Stack
- open-instruct-v1
- awesome-instruction-dataset list of instruction datasets by yadongC
- The Embedding Archives Millions of Wikipedia Article Embeddings in multiple languages
- Rereplit-finetuned-v1-3b & replit-code-v1-3b outperforming all coding OSS models, gets released soon
- alpaca_evol_instruct_70k an instruction-following dataset created using Evol-Instruct, used to fine-tune WizardLM
- gpt4tools_71k.json from GPT4Tools paper, having 71k instruction-following examples for sound/visual/text instructions
- WizardVicuna 70k dataset used to fine tune WizardVicuna
- Numbers every LLM Developer should know
- airoboros uncensored
- CoT collection, paper
- airoboros-gpt4 fine-tuning dataset optimized for trivia, math, coding, closed context question answering, multiple choice, writing
- fin-llama a LLaMA finetuned for finance, code, model
- LLM Model Cards
- GPTs are GPTs: An early look at the labor market impact potential of LLMs
- ViperGPT Visual Inference via Python Execution for reasoning
- Emergent Abilities of LLMs , blog post
- facts checker reinforcement
- LLaVA: Large Language and Vision Assistant, combining LLaMA with a visual model. Delta-weights released
- Mass Editing Memory in a Transformer
- MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
- WizardLM | Fine tuned LLaMA 7B with evolving instructions, outperforming chatGPT and Vicuna 13B on complex test instructions (code, delta weights)
- Scaling Transformer to 1M tokens and beyond with RMT
- AudioGPT | Understanding and Generating Speech, Music, Sound, and Talking Head (github, hf space)
- Chameleon-llm, a paper about Plug-and-Play Compositional Reasoning with GPT-4
- GPT-4-LLM share data generated by GPT-4 for building an instruction-following LLMs with supervised learning and reinforcement learning. paper
- GPT4Tools Teaching LLM to Use Tools via Self-instruct. code
- CAMEL: Communicative Agents for "Mind" Exploration of Large Scale Language Model Society. preprint paper, website
- Poisoning Language Models During Instruction Tuning
- SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot
- LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention
- Dromedary: Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision, code, weights
- Unlimiformer: transformer-based model that can process unlimited length input by offloading attention computation to a k-nearest-neighbor index, extending the capabilities of existing models like BART and Longformer without additional weights or code modifications. code
- Salesforce LAVIS provides a comprehensive Python library for language-vision intelligence research, including state-of-the-art models like BLIP-2 for vision-language pretraining and Img2LLM-VQA for visual question answering, alongside a unified interface
- FLARE an active retrieval augmented generation technique that iteratively predicts, retrieves, and refines content, improving the accuracy and efficiency of long-form text generation in language models
- Hyena a subquadratic-time layer that has the potential to significantly increase context length in sequence models, using a combination of long convolutions and gating. Long Convs and Hyena implementations
- FastServe an efficient distributed inference serving system for LLMs that minimizes job completion time using preemptive scheduling and efficient GPU memory management, built on NVIDIA FasterTransformer.
- FrugalGPT is a model that uses LLM cascade to optimize the performance and cost-efficiency of LLMs like GPT-4.
- Landmark Attention LLaMa 7B with 32k tokens. Code, llama7b diff weights, merged llama7b weights
- QLORA Efficient Finetuning of Quantized LLMs
- Tree of Thoughts (ToT) Enables exploration over text, improves strategic decision-making in language models. Code. Example implementation, discussion
- MEGABYTE Efficient multiscale decoder architecture for long-sequence modeling.
- PandaGPT: project page, code, model combines ImageBind and Vicuna to understand and combine multimodal inputs from text, image, audio, depth, thermal, and IMU.
- LIMA Less Is More for Alignment. Shows fine-tuning with 1000 carefully curated prompts without reinforcement learning can outperforms GPT-4 in many cases
- Gorilla a finetuned LLaMA-based model that surpasses GPT-4 in writing API calls and reduces hallucination. project, code
- Voyager Open-Ended Embodied Minecraft Agent using LLMs, project, code
- BigTrans llama adapted to multilingual translation over 100 languages, outperforming chatGPT in 8 language-pairs
- BPT memory-efficient approach to processing long input sequences in Transformers
- Lion efficiently transfers knowledge from a closed-source LLM to an open-source student model
- Undetectable Watermarks for Language Models using one-way functions
- ALiBi Train Short Test Long. Attention with Linear Biases Enables Input Length Extrapolation. code
- The Curse of Recursion: Training on Generated Data Makes Models Forget
- Brainformers a complex block for natural language processing that outperforms state-of-the-art Transformers in efficiency and quality
- AWQ Activation aware Weight Quantization, code
- SpQR quantization by Tim Dettmers, code, twitter
- LLM Worksheet by randomfoo2
- The full story of LLMs
- Brief history of llama models
- A timeline of transformer models
- Every front-end GUI client for ChatGPT API
- LLMSurvey a collection of papers and resources including an LLM timeline
- rentry.org/lmg_models a list of llama derrivates and models
- Timeline of AI and language models and Model Comparison Sheet by Dr. Alan D. Thompson
- Brex's Prompt Engineering Guide an evolving manual providing historical context, strategies, guidelines, and safety recommendations for building programmatic systems on OpenAI's GPT-4
- LLMs Practical Guide actively curated collection of a timeline and guides for LLMs, providing a historical context and restrictions based on this paper and community contributions
- LLMSurvey based on this paper, builds a collection of further papers and resources related to LLMs including a timeline
- LLaMAindex can now use Document Summary Index for better QA performance compared to vectorDBs
- ossinsight.io chat-gpt-apps Updated list of top chatGPT related repositories
- Opinionate.io AI Debating AI
- phind.com Developer Search Engine
- Voice Q&A Assistant using ChatGPT API, Embeddings, Gradio, Eleven Labs and Whisper
- chatpdf, Q&A for PDFs
- https://github.com/bigscience-workshop/petals
- https://github.com/chavinlo/distributed-diffusion
- https://github.com/VoltaML/voltaML-fast-stable-diffusion
- https://github.com/FMInference/FlexGen
- https://github.com/alpa-projects/alpa
- https://github.com/kir-gadjello/zipslicer
- https://github.com/modular-ml/wrapyfi-examples_llama
- https://github.com/tloen/llama-int8
- 4 bits quantization of LLaMa using GPTQ (discussion)
- https://petals.ml/
- https://github.com/facebookincubator/AITemplate
- https://github.com/HazyResearch/flash-attention
- https://github.com/stochasticai/x-stable-diffusion
- ExLlama a more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights. By ReturningTarzan
- tokenmonster ungreedy tokenizer increases inference speed and context-length by 35% for pre-training on new LLMs
- Open LLM Leaderboard by HuggingFaceH4
- LMSys Chatbot Arena Leaderboard, blogpost is an anonymous benchmark platform for LLMs that features randomized battles in a crowdsourced manner
- Current best choices on LocalLLaMA reddit
- LLM Logic Tests by YearZero on reddit/localllama
- paperswithcode has LLM SOTA leaderboards, but usually just for foundation models
- Can AI code a self-evaluating interview for AI coding models. code
- Gotzmann LLM Score v2 by Gatzuma on Reddit
- Aviary Explorer open source utility to compare leading OSS LLMs and see votes, pricing per token etc.
- Comparative look at (ggml) quantization and parameter size part 1 by KerfuffleV2
- Updated relative comparison of ggml quantization types and effect on perplexity part 2 by KerfuffleV2
- Programming performance ranking for popular LLaMAs using HumanEval+ by ProfessionalHand9945
- llm-humaneval-benchmarks HumanEval+
- CoT Hub
- C-Eval Benchmark
- programming eval by catid from reddit, code
- Big-bench a collaborative benchmark featuring over 200 tasks for evaluating the capabilities of llms
- Pythia interpretability analysis for autoregressive transformers during training
- AlpacaEval automatic evaluation for instruction following LLMs, validated against 20k human annotations, reddit announcement
- Pinecone proprietary vector search for semantic search, recommendations and information retrieval
- FAISS Library for Efficient Similarity Search and Clustering using vectors
- Weaviate open source vector DB for services like OpenAI, HF etc for text, image, Q&A etc.
- vespa.ai one of the only scalable vector DBs that supports multiple vectors per schema field
- LanceDB free open-source serverless vector DB with support for langchain, llamaindex and multi-modal data
- Deeplake Vector Database for audio, text, vectors, video
- milvus open-source cloud-native vector DB focusing on embedding vectors converted from unstructured data
- chroma open-source embedding database