Avigail Morris Motsary @ Art On Threads

As i have only 4gb of vram, i am thinking of running whisper in gpu and ollama in cpu I want to use the mistral model, but create a lora to act as an assistant that primarily references data i've supplied during training How do i force ollama to stop using gpu and only use cpu

Avigail Go (@avigail_go) • Threads, Say more

Alternatively, is there any way to force ollama to not use vram?

To get rid of the model i needed on install ollama again and then run ollama rm llama2

Hey guys, i am mainly using my models using ollama and i am looking for suggestions when it comes to uncensored models that i can use with it Since there are a lot already, i feel a bit overwhelmed. I'm currently downloading mixtral 8x22b via torrent Until now, i've always ran ollama run somemodel:xb (or pull)

So once those >200gb of glorious… Yes, i was able to run it on a rpi Mistral, and some of the smaller models work Llava takes a bit of time, but works

Avigail Go (@avigail_go) • Threads, Say more

Details

For text to speech, you’ll have to run an api from eleveabs for example

I haven’t found a fast text to speech, speech to text that’s fully open source yet If you find one, please keep us in the loop. I’m running ollama on an ubuntu server with an amd threadripper cpu and a single geforce 4070 I have 2 more pci slots and was wondering if there was any advantage adding additional gpus

Does ollama even support that and if so do they need to be identical gpus??? How to make ollama faster with an integrated gpu I decided to try out ollama after watching a youtube video The ability to run llms locally and which could give output faster amused me

Avigail Em (@avigail92) • Threads, Say more

Details

But after setting it up in my debian, i was pretty disappointed

I downloaded the codellama model to test I asked it to write a cpp function to find prime. Ok so ollama doesn't have a stop or exit command We have to manually kill the process

And this is not very useful especially because the server respawns immediately So there should be a stop command as well Yes i know and use these commands But these are all system commands which vary from os to os

Avigail Motsary (@avigail__art) on Threads

Details

I am talking about a single command.

I've just installed ollama in my system and chatted with it a little Unfortunately, the response time is very slow even for lightweight models like… I'm using ollama to run my models

Avigail Go (@avigail_go) • Threads, Say more

To get rid of the model i needed on install ollama again and then run ollama rm llama2

For text to speech, you’ll have to run an api from eleveabs for example

But after setting it up in my debian, i was pretty disappointed

I am talking about a single command.

Share with friends