Skip to content

Instantly share code, notes, and snippets.

@Jaid
Last active June 20, 2024 00:15
Show Gist options
  • Save Jaid/5ddee6cf20145da1b1b8745ef23b59e1 to your computer and use it in GitHub Desktop.
Save Jaid/5ddee6cf20145da1b1b8745ef23b59e1 to your computer and use it in GitHub Desktop.
Current best open-source AI models

Current best open-source AI models

Task Model Params (billions) Notes
Image Object Detection DETR-DC5 R101 0.607
Image Masking Segment Anything + ViT Huge 0.641
Image Depth Map Creation Depth Anything v2 Huge 1.3 not released yet, only Small to Large
Caption to Image Stable Diffusion XL 2.6
Caption to Video Open-Sora
Image to Video Stable Video Diffusion XT
Caption to Sound
Image to 3D InstantMesh taken from 3D Arena / maybe SV3D
Coding LLM (Instruct) DeepSeek Coder v2 Instruct 236
Code LLM (Completion / Filling holes) DeepSeek Coder v2 Base 236
General LLM (Instruct) Llama 3 70.6
General LLM (Completion / Filling holes) Roberta Large 0.355
Text to Speech ๐Ÿ‡บ๐Ÿ‡ธ Parler 0.6
Speech to Text ๐Ÿ‡บ๐Ÿ‡ธ Whisper Large v3 1.54
Text to Speech ๐Ÿ‡ฉ๐Ÿ‡ช
Speech to Text ๐Ÿ‡ฉ๐Ÿ‡ช
Speech to Language Lang ID
Speaking animation for portraits Hallo
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment