Phyo Arkar Lwin v3ss0n

## llama-home.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                v3ss0n
                / llama-home.md
            
            
              Created
              May 14, 2023 13:53
                — forked from rain-1/llama-home.md
            
              
                How to run Llama 13B with a 6GB graphics card
              
          
    This worked on 14/May/23. The instructions will probably require updating in the future.

llama is a text prediction model similar to GPT-2, and the version of GPT-3 that has not been fine tuned yet.
It is also possible to run fine tuned versions (like alpaca or vicuna with this. I think. Those versions are more focused on answering questions)

It is possible to run LLama 13B with a 6GB graphics card now! (e.g. a RTX 2060). Thanks to the amazing work involved in llama.cpp. The latest change is CUDA/cuBLAS which allows you pick an arbitrary number of the transformer layers to be run on the GPU. This is perfect for low VRAM.

Clone llama.cpp from git, I am on commit 08737ef720f0510c7ec2aa84d7f70c691073c35d.

git clone https://github.com/ggerganov/llama.cpp.git


cd llama.cpp


## controllers.application.js
import Ember from 'ember';

export default Ember.Controller.extend({
  appName: 'Ember Twiddle',
  allInvoices: Ember.computed(function() {
    return this.store.peekAll('invoice');
  })
});

## application.controller.js
import Ember from 'ember';

export default Ember.Controller.extend({
  appName:'Ember Twiddle'
});
	import Ember from 'ember';

	export default Ember.Controller.extend({
	appName: 'Ember Twiddle',
	allInvoices: Ember.computed(function() {
	return this.store.peekAll('invoice');
	})
	});
	import Ember from 'ember';

	export default Ember.Controller.extend({
	appName:'Ember Twiddle'
	});