Refactoring using local AI solutions

ai
retag add tags

Hi! Could ir be possible that at some point it would be possible to use not only online AI interface, but also local - using LMStudio or GPT4All, for example?

Janis-E

asked 2025-02-11 13:22:06 -0600

4.3k

Wingware Support

updated 2025-04-25 07:41:19 -0600

edit flag offensive 0 remove flag close merge delete

Comments

add a comment see more comments

2 Answers

Sort by

oldest

newest

most voted

Wing 11.0.0.3 added support for OpenAI API compatible AI providers, including those running locally like Ollama. I'm still not seeing very good results, but it's possible to use locally run models now.

4.3k

Wingware Support

answered 2025-04-25 07:41:09 -0600, updated 2025-04-25 07:43:05 -0600

edit flag offensive 0 remove flag delete link

Comments

what about some howto on configuration/connection to Ollama?

Janis-E (2025-04-25 11:08:45 -0600) edit

See AI Assisted Development > Configuring AI Providers > Running Models Locally in the integrated copy of the manual, from Wing 11.0.0.3+'s Help menu. Eventually this will also be on our website but we avoid posting pre-release docs to the site to avoid confusing people.

Wingware Support (2025-04-25 11:57:10 -0600) edit

This is very good news! :)

Joril (2025-05-20 04:30:30 -0600) edit

I'm using Wing 11 with ollama, models qwen2.5 and qwen3. It looks like the default prompt does not sit well with them, they end up returning the source code with no meaningful modification... Wing should allow using a custom prompt, to be more flexible. (Tried the ollama command line with a modified prompt, and it works better)

Joril (2025-05-22 03:36:44 -0600) edit

I suspect the fact that we include all of much of the content of the current file with requests is blowing the context window on the model you're using. We do that because at least in the past it produced much better results with OpenAI, and I think also Gemini and others. But possibly with locally run models it's too much.

A work-around might be to reduce the maximum context size on the Context tab of the AI provider configuration for Ollama. That will truncate/skeletonize more of the context and maybe solve the problem. Please let me know if this helps. Thanks!

Wingware Support (2025-05-23 08:47:07 -0600) edit

add a comment see more comments

It is something we plan to look into, but not yet available. It will be interesting to compare results from a local solution vs. OpenAI. Hopefully it'll work well.

255

Wingware Admin

answered 2025-02-11 20:34:43 -0600

edit flag offensive 0 remove flag delete link

Comments

Unfortunately it looks like the models we can run locally perform terribly compared with gpt-4o via the OpenAI API. I tried Meta-Llama-3-8B-Instruct.Q4_0.gguf and gpt4all-13b-snoozy-q4_0.gguf via gpt4all. The results were more work than writing the code myself, exhibiting many of the problems I remember encountering with OpenAI models about 2 years ago. So for now we're deferring any further effort towards allowing models to be run locally. We'll revisit this, of course, as things advance.

Wingware Admin (2025-02-13 11:15:19 -0600) edit

Have to agree - I tested it with some test task I used half a year ago with Claude having excellent experience, this time trying to use qwen2.5-coder-7b-instruct-Q4_K_M.gguf on LMStudio. It produced mediocre results despite pointing out the obvious problems. I also tried smaller models, but those were not able to produce a single line of python code. Didn't try larger models as my computer does not have required resources to run them.

Janis-E (2025-02-14 00:32:54 -0600) edit

I tried this on a very capable machine with 12 CPU cores, 19 GPU cores, and 16 neural engines, although I have no idea if gpt4all manages to use any of that hardware. It wasn't terrible fast so perhaps not, but I suspect the lack of results is more the model than the computing hardware. It was often stopping early, saying things like "fill in the rest of the implementation here" which OpenAI did a lot a while back but they explicitly fixed that. I'm hopeful that locally run models will eventually catch up...

Wingware Admin (2025-02-14 16:50:52 -0600) edit

Just out of interest, Ollama implements some of OpenAI's api, so if it covers what WingIDE uses it would be easy to implement. It's covered at https://github.com/ollama/ollama/blob/m… if you're interested.

Ian_S (2025-03-15 10:18:53 -0600) edit

Thanks, we'll definitely try to use that if possible. We should expose the ability to set base_url in any case.

Wingware Support (2025-03-15 10:37:18 -0600) edit

add a comment see more comments

Your Answer

Please start posting anonymously - your entry will be published after you log in or create a new account. This space is reserved only for answers. If you would like to engage in a discussion, please instead post a comment under the question or an answer that you would like to discuss.

Add Answer

Refactoring using local AI solutions edit

Comments

2 Answers

Comments

Comments