YaKaiLi/llm-tool.md

## llm-tool.md

      
    Raw
  

              llm-tool.md
            
          
    from https://github.com/underlines/awesome-marketing-datascience/blob/master/llm-tools.md
Tools

Native GUIs

openAI


chatgptui/desktop
chatbox is a Windows, Mac & Linux native ChatGPT Client
BingGPT Desktop application of new Bing's AI-powered chat
cheetah Speech to text for remote coding interviews, giving you hints from GTP3/4

Local LLMs

cpp / ggml:

llama.cpp runs ggml models up to 4-bit quantized on mac, linux and windows natively. Supports the new ggmlv3 format and runs on CPU and GPU. Allows for mixed use of CPU/GPU using BLAS libraries like cuBLAS, CLBLas etc.
Alpaca.cpp
koboldcpp llama.cpp with a fancy UI, persistent stories, editing tools, memory etc. Supporting ggmlv3 and old ggml, CLBlast and llama, RWKV, GPT-NeoX, Pythia models
Serge chat interface based on llama.cpp for running Alpaca models. Entirely self-hosted, no API keys needed
llama MPS inference on Apple Silicon GPU using much lower power but is slightly slower than llama.cpp which uses CPU
bloomz.cpp Inference of HuggingFace's BLOOM-like models in pure C/C++
RWKV.cpp CPU only port of BlinkDL/RWKV-LM to ggerganov/ggml. Supports FP32, FP16 and quantized INT4.
RWKV Cuda a torchless, c++ rwkv implementation with 8bit quantization written in cuda
secondbrain Multi-platform desktop app to download and run LLMs locally in your computer

gpt4all:

gpt4all terminal and gui version to run local gpt-j models, compiled binaries for win/osx/linux
gpt4all.zig terminal version of GPT4All
gpt4all-chat Cross platform desktop GUI for GPT4All  models (gpt-j)

others:

Lit-llama training, fine tuning and inference of llama
Dalai LLaMA-based ChatGPT for single GPUs
ChatLLaMA LLaMA-based ChatGPT for single GPUs
mlc-llm, run any LLM on any hardware (iPhones, Android, Win, Linux, Mac, WebGPU, Metal. NVidia, AMD)
faraday.dev Run open-source LLMs on your Win/Mac. Completely offline. Zero configuration.
ChatALL concurrently sends prompts to multiple LLM-based AI bots both local and APIs and displays the results
pyllama hacked version of LLaMA based on Meta's implementation, optimized for Single GPUs
gmessage visually pleasing chatbot that uses a locally running LLM server and supports multiple themes, chat history search, text to speech, JSON file export, and OpenAI API compatible Python code
selfhostedAI one-click deployment of RWKV, ChatGLM, llama.cpp models for substituting the openAI API to a locally hosted API

Web GUIs

openAI


TypingMind
Chatwithme.chat
enricoros/nextjs-chatgpt-app
no8081/chatgpt-demo
IPython-gpt use chatGPT directly inside jupyter notebooks
Chatbot UI An open source ChatGPT UI

Local LLMs


Text Generation Webui An all purpose UI to run LLMs of all sorts with optimizations (running LLaMA-13b on 6GB VRAM, HN Thread)
Text Generation Webui Ph0rk0z fork supporting all GPTQ versions and max context of 8192 instead of 4096 (because some models support longer context now)
Alpaca-LoRa-Serve
chat petals web app + HTTP and Websocket endpoints for BLOOM-176B inference with the Petals client
Alpaca-Turbo Web UI to run alpaca model locally on Win/Mac/Linux
FreedomGPT Web app that executes the FreedomGPT LLM locally
HuggingChat open source chat interface for transformer based LLMs by Huggingface
openplayground enables running LLM models on a laptop using a full UI, supporting various APIs and local HuggingFace cached models
gpt4all Web UI user friendly all-in-one interface, runs gpt_j, gptq, ggml and other model types
RWKV-Runner Easy installation and running of RWKV Models, providing a local OpenAI API, GUI and custom CUDA kernel acceleration. Supports 2gb up to 32gb VRAM
BrainChulo Chat App with vector based Long-Term Memory supporting one-shot, few-shot and Tool capable agents

Voice Assistants

openAI


datafilik/GPT-Voice-Assistant
Abdallah-Ragab/VoiceGPT
LlmKira/Openaibot
BarkingGPT Audio2Audio by using Whisper+chatGPT+Bark
gpt_chatbot Windows / elevenlabs TTS + pinecone long term memory
gpt-voice-conversation-chatbot using GPT3.5/4 API, elevenlab voices, google tts, session long term memory
JARVIS-ChatGPT conversational assistant that uses OpenAI Whisper, OpenAI ChatGPT, and IBM Watson to provide quasi-real-time tips and opinions.
ALFRED LangChain Voice Assistant, powered by GPT-3.5-turbo, whisper, Bark, pyttsx3 and more

Local LLMs


bark TTS for oobabooga/text-generation-webui make your local LLM talk
bark TTS for oobabooga/text-generation-webui another implementation
iris-llm local voice chat agent

Information retrieval

openAI


sqlchat Use OpenAI GPT3/4 to chat with your database
chat-with-github-repo which uses streamlit, gpt3.5-turbo and deep lake to answer questions about a git repo
mpoon/gpt-repository-loader uses Git and GPT-4 to convert a repository into a text format for various tasks, such as code review or documentation generation.

Local LLMs


LlamaIndex provides a central interface to connect your LLM's with external data
Llama-lab home of llama_agi and auto_llama using LlamaIndex
PrivateGPT a standalone question-answering system using LangChain, GPT4All, LlamaCpp and embeddings models to enable offline querying of documents
Spyglass tests an Alpaca integration for a self-hosted personal search app. Select the llama-rama feature branch. Discussion on reddit
local_llama chatting with your PDFs  offline. gpt_chatwithPDF alternative with the ultimate goal of using llama instead of chatGPT
Sidekick Information retrieval for LLMs
DB-GPT SQL generation, private domain Q&A, data processing, unified vector storage/indexing, and support for various plugins and LLMs
localGPT a privateGPT inspired document question-answering solution using GPU instead of CPU acceleration and InstructorEmbeddings, which perform better according to leaderboards instead of LlamaEmbeddings
LocalDocs plugin for GPT4All
annoy_ltm extension to add long term memory to chatbots using a nearest neighbor vector DB for memory retrieval
ChatDocs PrivateGPT + Web UI + GPU Support + ggml, transformers, webui
PAutoBot document question-answering engine developed with LangChain, GPT4All, LlamaCpp, ChromaDB, PrivateGPT, CPU only

Model Agnostic


Paper QA LLM Chain for answering questions from documents with citations, using OpenAI Embeddings or local llama.cpp, langchain and FAISS Vector DB
BriefGPT document summarization and querying using OpenAI' and locally run LLM's using LlamaCpp or GPT4ALL, and embeddings stored as a FAISS index, built using Langchain.

Browser Extensions

openAI


sider chrome side-bar for chatGPT and OpenAI API supporting custom prompts and text highlighting
chathub-dev/chathub
Glarity open-source chrome extension to write summaries for various websites including custom ones and YouTube videos. Extensible
superpower-chatgpt chrome  extension / firefox addon to add missing features like Folders, Search, and Community Prompts to ChatGPT

Local LLMs


chatGPTBox add useful LLM chat-boxes to github and other websites, supporting self-hosted model (RWKV, llama.cpp, ChatGLM)

Agents / Automatic GPT

openAI


Auto GPT
AgentGPT Deploy autonomous AI agents, using vectorDB memory, web browsing via LangChain, website interaction and more including a GUI
microGPT  Autonomous GPT-3.5/4 agent, can analyze stocks, create art, order pizza, and perform network security tests
Auto GPT Plugins
AutoGPT-Next-Web An AgentGPT fork as a Web GUI
AutoGPT Web
AutoGPT.js
LoopGPT a re-implementation of AutoGPT as a proper python package, modular and extensible
Camel-AutoGPT Communicaton between Agents like BabyAGI and AutoGPT
BabyAGIChatGPT is a fork of BabyAGI to work with OpenAI's GPT, pinecone and google search
GPT Assistant An autonomous agent that can access and control a chrome browser via Puppeteer
gptchat a client which uses GPT-4, adding long term memory, can write its own plugins and can fulfill tasks
Chrome-GPT  AutoGPT agent employing Langchain and Selenium to interact with a Chrome browser session, enabling Google search, webpage description, element interaction, and form input
autolang Another take on BabyAGI, focused on workflows that complete. Powered by langchain.
ai-legion A framework for autonomous agents who can work together to accomplish tasks.
generativeAgent_LLM Generative Agents with Guidance, Langchain, and local LLMs, implementation of the "Generative Agents: Interactive Simulacra of Human Behavior" paper, blogpost

Local LLMs


Auto Vicuna Butler Baby-AGI fork / AutoGPT alternative to run with local LLMs
BabyAGI AI-Powered Task Management for OpenAI + Pinecone or Llama.cpp
Agent-LLM Webapp to control an agent-based Auto-GPT alternative, supporting GPT4, Kobold, llama.cpp, FastChat, Bard, Oobabooga textgen
auto-llama-cpp fork of Auto-GPT with added support for locally running llama models through llama.cpp
AgentOoba autonomous AI agent extension for Oobabooga's web ui
RecurrentGPT Interactive Generation of (Arbitrarily) Long Text. Uses LSTM, prompt-engineered recurrence, maintains short and long-term memories, and updates these using semantic search and paragraph generation.
SuperAGI open-source framework that enables developers to build, manage, and run autonomous agents. Supports tools extensions, concurrent agents, GUI, console, vector DBs, multi modal, telemetry and long term memory

Multi Modal


Alpaca-Turbo | Web UI to run alpaca model locally on Win/Mac/Linux
FreedomGPT | Web app that executes the FreedomGPT LLM locally
huggingGPT / JARVIS Connects LLMs with huggingface specialized models
Langchain-huggingGPT reimplementation of HuggingGPT using langchain
OpenAGI AGI research platform, solves multi step tasks with RLTF and supports complex model chains
ViperGPT implementation for visual inference and reasoning with openAPI
TaskMatrix former visual-chatgpt connects ChatGPT and a series of Visual Foundation Models to enable sending and receiving images during chatting.
PandaGPT combines ImageBind and Vicuna to understand and combine multimodal inputs from text, image, audio, depth, thermal, and IMU.
AGiXT agents with memory, model agnostic, docker deployment, plugin extendable, chat feature, speech to text and text to speech, REST api and more

Code generation


FauxPilot open source Copilot alternative using Triton Inference Server
Turbopilot open source LLM code completion engine and Copilot alternative
Tabby Self hosted Github Copilot alternative
starcoder.cpp
GPTQ-for-SantaCoder 4bit quantization for SantaCoder
supercharger Write Software + unit tests for you, based on Baize-30B 8bit, using model parallelism
Autodoc toolkit that auto-generates codebase documentation using GPT-4 or Alpaca, and can be installed in a git repository in about 5 minutes.
smol-ai developer a personal junior developer that scaffolds an entire codebase with a human-centric and coherent whole program synthesis approach using <200 lines of Python and Prompts.
locai kobold/oobabooga -compatible api for vscode
oasis local LLaMA models in VSCode

Libraries and Wrappers

openAI


acheong08/ChatGPT Python reverse engineerded chatGPT API
gpt4free Use reverse engineered GPT3.5/4 APIs of other website's APIs
GPTCache, serve cached results based on embeddings in a vector DB, before querying the OpenAI API.
kitt TTS + GPT4 + STT to create a conference call audio bot
Marvin simplifies AI integration in software development with easy creation of AI functions and bots managed through a conversational interface
chatgpt.js client-side JavaScript library for ChatGPT
ChatGPT-Bridge use chatGPT plus' GPT-4 as a local API
Powerpointer connects to openAPI GPT3.5 and creates a powerpoint out of your content
EdgeGPT Reverse engineered API of Microsoft's Bing Chat using Edge browser

Local LLMs


FastLLaMA Python wrapper for llama.cpp
WebGPT Inference in pure javascript
TokenHawk performs hand-written LLaMA inference using WebGPU, utilizing th.cpp, th-llama.cpp, and th-llama-loader.cpp, with minimal dependencies
WasmGPT ChatGPT-like chatbot in browser using ggml and emscripten
AutoGPTQ easy-to-use model GPTQ quantization package with user-friendly CLI
gpt-llama.cpp Replace OpenAi's GPT APIs with llama.cpp's supported models locally
llama-node JS client library for llama (or llama based) LLMs built on top of llama-rs and llama.cpp.
TALIS serves a LLaMA-65b API, optimized for speed utilizing dual RTX 3090/4090 GPUs on Linux
Powerpointer-For-Local-LLMs connects to oobabooga's API and creates a powerpoint out of your content
OpenChatKit open-source project that provides a base to create both specialized and general purpose chatbots and extensible retrieval system, using GPT-NeoXT-Chat-Base-20B as a base model
webgpu-torch Tensor computation with WebGPU acceleration
llama-api-server that uses llama.cpp and emulates an openAI API
CTransformers python bindings for transformer models in C/C++ using GGML library, supporting GPT-2/J/NeoX, StableLM, LLaMA, MPT, Dollyv2, StarCoder
basaran GUI and API as a drop-in replacement of the OpenAI text completion API. Broad HF eco system support (not only llama)
CodeTF one-stop Python transformer-based library for code LLMs and code intelligence, training and inferencing on code summarization, translation, code generation

Model agnostic


LangChain | framework for developing LLM applications (example, paolorechia/learn-langchain with vicuna and GPTQ 4 bit support)
LangFlow GUI for Langchain
Toolformer implementation Allows LLMs to use Tools
megabots to create LLM bots by providing Q&A, document retrieval, vector DBs, FastAPI, Gradio UI, GPTCache, guardrails, whisper, supports OpenAI API (local LLMs planned)
Jsonformer: Generate Structured JSON from Language Models by handling JSON synthax, and letting LLM just output the values
gorilla Enables LLMs to use tools by semantically and syntactically correctly invoking APIs. Reduces hallucination, custom trained model weights based on llama-7b

Fine Tuning & Training


simple llama finetuner
LLaMA-LoRA Tuner
alpaca-lora
StackLLaMA Fine-Tuning Guide by huggingface
xTuring LLM finetuning pipeline supporting LoRa & 4bit
Microsoft DeepSpeed Chat
How to train your LLMs
H2O LLM Studio | Framework and no-code GUI for fine tuning SOTA LLMs
Implementation of LLaMA-Adapter, to fine tune instructions within hours
Hivemind Training at home
Axolotl a llama, pythia, cerebras training environment optimized for Runpod supporting qlora, 4bit, flash attention, xformers
LMFlow toolbox for finetuning, designed to be user-friendly, speedy, and reliable
qlora uses bitsandbytes quantization and PEFT and transformers for efficient finetuning of quantized LLMs
GPTQlora Efficient Finetuning of Quantized LLMs with GPTQ QLoRA and AutoGPTQ for quantization

Frameworks


Vicuna FastChat
SynapseML (previously known as MMLSpark),an open-source library that simplifies the creation of massively scalable machine learning (ML) pipelines
Microsoft guidance efficient Framework for Enhancing Control and Structure in Modern Language Model Interactions. Demo project by paolorechia for local text-generation-webui. reddit thread. guidance fork and llama-cpp-python fork how-to on reddit
Microsoft semantic-kernel a lightweight SDK enabling integration of AI Large Language Models (LLMs) with conventional programming languages
Colossal-AI unified deep learning system that provides a collection of parallel components for distributed deep learning models. Provides data parallelism, pipeline parallelism, and tensor parallelism

Resources

Data sets


Alpaca-lora instruction finetuned using Low Rank Adaption
codealpaca Instruction training data set for code generation
LAION AI / Open-Assistant Dataset (https://github.com/LAION-AI/Open-Assistant / https://projects.laion.ai/Open-Assistant/ / https://open-assistant.io)
ShareGPT pre-cleaned, English only "unfiltered," and 2048 token split version of the ShareGPT dataset ready for finetuning
Vicuna ShareGPT pre-cleaned 90k conversation dataset
Vicuna ShareGPT unfiltered
GPTeacher
alpaca-cleaned
codealpaca 20k
gpt3all pruned
gpt4all_prompt_generations_with_p3
gpt4all_prompt_generations
alpaca-plus-gpt4all-without-p3
Alpaca dataset from Stanford, cleaned and curated
Alpaca Chain of Thought fine tuning dataset for EN and CN
PRESTO paper Multilingual dataset for parsing realistic task-oriented dialogues by Google & University of Rochester, California, Santa Barbara, Columbia
RedPajama Dataset and model similar to LLaMA but truly open source and ready for commercial use. hf
BigCode The Stack
open-instruct-v1
awesome-instruction-dataset list of instruction datasets by yadongC
The Embedding Archives Millions of Wikipedia Article Embeddings in multiple languages
Rereplit-finetuned-v1-3b & replit-code-v1-3b outperforming all coding OSS models, gets released soon
alpaca_evol_instruct_70k an instruction-following dataset created using Evol-Instruct, used to fine-tune WizardLM
gpt4tools_71k.json from GPT4Tools paper, having 71k instruction-following examples for sound/visual/text instructions
WizardVicuna 70k dataset used to fine tune WizardVicuna
Numbers every LLM Developer should know
airoboros uncensored
CoT collection, paper
airoboros-gpt4 fine-tuning dataset optimized for trivia, math, coding, closed context question answering, multiple choice, writing
fin-llama a LLaMA finetuned for finance, code, model

Research


LLM Model Cards
GPTs are GPTs: An early look at the labor market impact potential of LLMs
ViperGPT Visual Inference via Python Execution for reasoning
Emergent Abilities of LLMs , blog post
facts checker reinforcement
LLaVA: Large Language and Vision Assistant, combining LLaMA with a visual model. Delta-weights released
Mass Editing Memory in a Transformer
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
WizardLM | Fine tuned LLaMA 7B with evolving instructions, outperforming chatGPT and Vicuna 13B on complex test instructions (code, delta weights)
Scaling Transformer to 1M tokens and beyond with RMT
AudioGPT | Understanding and Generating Speech, Music, Sound, and Talking Head (github, hf space)
Chameleon-llm, a paper about Plug-and-Play Compositional Reasoning with GPT-4
GPT-4-LLM share data generated by GPT-4 for building an instruction-following LLMs with supervised learning and reinforcement learning. paper
GPT4Tools Teaching LLM to Use Tools via Self-instruct. code
CAMEL: Communicative Agents for "Mind" Exploration of Large Scale Language Model Society. preprint paper, website
Poisoning Language Models During Instruction Tuning
SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot
LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention
Dromedary: Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision, code, weights
Unlimiformer: transformer-based model that can process unlimited length input by offloading attention computation to a k-nearest-neighbor index, extending the capabilities of existing models like BART and Longformer without additional weights or code modifications. code
Salesforce LAVIS provides a comprehensive Python library for language-vision intelligence research, including state-of-the-art models like BLIP-2 for vision-language pretraining and Img2LLM-VQA for visual question answering, alongside a unified interface
FLARE an active retrieval augmented generation technique that iteratively predicts, retrieves, and refines content, improving the accuracy and efficiency of long-form text generation in language models
Hyena a subquadratic-time layer that has the potential to significantly increase context length in sequence models, using a combination of long convolutions and gating. Long Convs and Hyena implementations
FastServe an efficient distributed inference serving system for LLMs that minimizes job completion time using preemptive scheduling and efficient GPU memory management, built on NVIDIA FasterTransformer.
FrugalGPT is a model that uses LLM cascade to optimize the performance and cost-efficiency of LLMs like GPT-4.
Landmark Attention LLaMa 7B with 32k tokens. Code, llama7b diff weights, merged llama7b weights
QLORA Efficient Finetuning of Quantized LLMs
Tree of Thoughts (ToT) Enables exploration over text, improves strategic decision-making in language models. Code. Example implementation, discussion
MEGABYTE Efficient multiscale decoder architecture for long-sequence modeling.
PandaGPT:  project page, code, model combines ImageBind and Vicuna to understand and combine multimodal inputs from text, image, audio, depth, thermal, and IMU.
LIMA Less Is More for Alignment. Shows fine-tuning with 1000 carefully curated prompts without reinforcement learning can outperforms GPT-4 in many cases
Gorilla a finetuned LLaMA-based model that surpasses GPT-4 in writing API calls and reduces hallucination. project, code
Voyager Open-Ended Embodied Minecraft Agent using LLMs, project, code
BigTrans llama adapted to multilingual translation over 100 languages, outperforming chatGPT in 8 language-pairs
BPT memory-efficient approach to processing long input sequences in Transformers
Lion efficiently transfers knowledge from a closed-source LLM to an open-source student model
Undetectable Watermarks for Language Models using one-way functions
ALiBi Train Short Test Long. Attention with Linear Biases Enables Input Length Extrapolation. code
The Curse of Recursion: Training on Generated Data Makes Models Forget
Brainformers a complex block for natural language processing that outperforms state-of-the-art Transformers in efficiency and quality
AWQ Activation aware Weight Quantization, code
SpQR quantization by Tim Dettmers, code, twitter

Other awesome resources


LLM Worksheet by randomfoo2
The full story of LLMs
Brief history of llama models
A timeline of transformer models
Every front-end GUI client for ChatGPT API
LLMSurvey a collection of papers and resources including an LLM timeline
rentry.org/lmg_models a list of llama derrivates and models
Timeline of AI and language models and Model Comparison Sheet by Dr. Alan D. Thompson
Brex's Prompt Engineering Guide an evolving manual providing historical context, strategies, guidelines, and safety recommendations for building programmatic systems on OpenAI's GPT-4
LLMs Practical Guide actively curated collection of a timeline and guides for LLMs, providing a historical context and restrictions based on this paper and community contributions
LLMSurvey based on this paper, builds a collection of further papers and resources related to LLMs including a timeline
LLaMAindex can now use Document Summary Index for better QA performance compared to vectorDBs
ossinsight.io chat-gpt-apps Updated list of top chatGPT related repositories

Product Showcases


Opinionate.io AI Debating AI
phind.com Developer Search Engine
Voice Q&A Assistant using ChatGPT API, Embeddings, Gradio, Eleven Labs and Whisper
chatpdf, Q&A for PDFs

Optimization


https://github.com/bigscience-workshop/petals
https://github.com/chavinlo/distributed-diffusion
https://github.com/VoltaML/voltaML-fast-stable-diffusion
https://github.com/FMInference/FlexGen
https://github.com/alpa-projects/alpa
https://github.com/kir-gadjello/zipslicer
https://github.com/modular-ml/wrapyfi-examples_llama
https://github.com/tloen/llama-int8
4 bits quantization of LLaMa using GPTQ (discussion)
https://petals.ml/
https://github.com/facebookincubator/AITemplate
https://github.com/HazyResearch/flash-attention
https://github.com/stochasticai/x-stable-diffusion
ExLlama a more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights. By ReturningTarzan
tokenmonster ungreedy tokenizer increases inference speed and context-length by 35% for pre-training on new LLMs

Benchmarking

Leaderboards


Open LLM Leaderboard by HuggingFaceH4
LMSys Chatbot Arena Leaderboard, blogpost is an anonymous benchmark platform for LLMs that features randomized battles in a crowdsourced manner
Current best choices on LocalLLaMA reddit
LLM Logic Tests by YearZero on reddit/localllama
paperswithcode has LLM SOTA leaderboards, but usually just for foundation models
Can AI code a self-evaluating interview for AI coding models. code
Gotzmann LLM Score v2 by Gatzuma on Reddit
Aviary Explorer open source utility to compare leading OSS LLMs and see votes, pricing per token etc.
Comparative look at (ggml) quantization and parameter size part 1 by KerfuffleV2
Updated relative comparison of ggml quantization types and effect on perplexity part 2 by KerfuffleV2
Programming performance ranking for popular LLaMAs using HumanEval+ by ProfessionalHand9945
llm-humaneval-benchmarks HumanEval+
CoT Hub
C-Eval Benchmark
programming eval by catid from reddit, code

Benchmark Suites


Big-bench a collaborative benchmark featuring over 200 tasks for evaluating the capabilities of llms
Pythia interpretability analysis for autoregressive transformers during training
AlpacaEval automatic evaluation for instruction following LLMs, validated against 20k human annotations, reddit announcement

AI DevOps


https://www.steamship.com/

Databases for ML


Pinecone proprietary vector search for semantic search, recommendations and information retrieval
FAISS Library for Efficient Similarity Search and Clustering using vectors
Weaviate open source vector DB for services like OpenAI, HF etc for text, image, Q&A etc.
vespa.ai one of the only scalable vector DBs that supports multiple vectors per schema field
LanceDB free open-source serverless vector DB with support for langchain, llamaindex and multi-modal data
Deeplake Vector Database for audio, text, vectors, video
milvus open-source cloud-native vector DB focusing on embedding vectors converted from unstructured data
chroma open-source embedding database