What models are you all using? #2673
Replies: 6 comments 15 replies
-
|
This is absolutely the type of discussions that I want to encourage on here. Also, keen to understand how people are developing with LLMs these days. I'll go first: For general, low-cost, low-impact conversations I use If I'm implementing a new feature (#2600), I enjoy using Claude Code to bounce ideas off and perhaps do an initial implementation. I've been using I also really like Gemini. I've used that quite a lot to create and edit the prompts in my prompt library. And occasionally use it for coding tasks to see how it performs. I will do more testing in the coming month. I've given up using self-hosted LLMs from Ollama. Never found their output to be useful and function calling still feels hit and miss. |
Beta Was this translation helpful? Give feedback.
-
|
I’m currently using GPT-5-mini via the OpenAI API for text generation |
Beta Was this translation helpful? Give feedback.
-
|
I mostly like to play around with local models. They aren't exactly building my next unicorn saas, but I just enjoy finding the limits. I have had the most success when I used CodeCompanion's inline mode (highlight a few lines, call |
Beta Was this translation helpful? Give feedback.
-
|
I have been using Qwen3-Coder-Next-IQ4_NL locally on a laptop with llamacpp (it requires about 46GB of VRAM) but I can actually get work done in chat mode. I have all the read tools enabled by default (local models shouldn't leak data) It's able to do edits and pair program with me, it's not Claude but it's great for local edits and scripts and clean up. List of things it has done:
I realize these are toy cases but it was able to call and use tools to do the operations by it self and it didn't completely mess everything up, which is new for local models. |
Beta Was this translation helpful? Give feedback.
-
|
I've been playing around with the local Ollama configuration (running qwen2.5 32b) and have noticed issues with the tool calling in the chat window. The inline code editor is somewhat spotty, if there is no code in the file it seems to struggle, but if I am simply asking it to edit an existing file it works fine. Can anyone describe their workflow with the local models using Ollama? I'm just trying to understand what it is, and is not capable of. (obviously agent mode does not work as evidenced by this thread) |
Beta Was this translation helpful? Give feedback.
-
|
I am playing with Gemma4:e4b on Raseon 780M with 16GB VRAM and 32GB RAM. llama.cpp acts as backend as ollama doesn't really support Gemma4 on Vulkan. My problem is that in this constellation such simple features as |
Beta Was this translation helpful? Give feedback.





Uh oh!
There was an error while loading. Please reload this page.
-
I know that this isn't directly related to codecompanion... But now that I hear claude is banning people and whatnot, I'm curious what LLM models you're all using at the moment?
Beta Was this translation helpful? Give feedback.
All reactions