Lajos Bencz lajosbencz

## code-assist.md

      
        
          
            
              
              1 file
            
          
          
            
              
              3 forks
            
          
          
            
              
              5 comments
            
          
          
            
              
              32 stars
            
          
        
        
          
              
          
          
            
                Birch-san
                / code-assist.md
            
            
              Last active
              March 4, 2024 19:32
            
              
                Local VSCode AI code assistance via starcoder + 4-bit quantization in ~11GB VRAM
              
          
        
      
        
  
      
    Install HF Code Autocomplete VSCode plugin.
We are not going to set an API token. We are going to specify an API endpoint.

We will try to deploy that API ourselves, to use our own GPU to provide the code assistance.
We will use bigcode/starcoder, a 15.5B param model.

We will use NF4 4-bit quantization to fit this into 10787MiB VRAM.

It would require 23767MiB VRAM unquantized. (still fits on a 4090, which has 24564MiB)!
Setup API