@simon Any clue why your #LLM tool with gpt4all's ggml-replit-code-v1-3b would perform worse than this replicate demo?
Is there a need to tweak the parameters for the model somewhere maybe?
https://replicate.com/replit/replit-code-v1-3b?prediction=zarihvjb2xfluvwsplgye4bude
@mauve worse in terms of speed or quality?
What operating system are you running?
@mauve looks like Replicate are running it on a A100 40GB, which is a ~$7,000 GPU!