txt.sour.is prologic@twtxt.net "The amazing thing I find by everything I ever try that isn't OpenAI's ChatGPT, is that basically it's all pretty useless at small scale. What do ..."

twtxt.net

Sun, Jun 18 07:28 2023 (1y ago)

↳ In-reply-to » Home | Tabby This is actually pretty cool and useful. Just tried this on my Mac locally of course and it seems to have quite good utility. What would be interesting for me would be to train it on my code and many projects 😅

The amazing thing I find by everything I ever try that isn’t OpenAI’s ChatGPT, is that basically it’s all pretty useless at small scale. What do I mean by this? Well, in trying a few different models and tools that you can run locally (not connected to any “cloud” service), the results are pretty underwhelming. For example with Tabby connected to VSCode (locally), you get “okay” results for pretty trivial shit. The moment you try to do anything interesting at all, you either get completely garbage code or copyright headers as suggestions.

I just don’t have time or energy to waste hours of a weekend on basically what amounts to statistical probabilistic models trained on complete garbage that generates or predicts in pretty awful ways.

I event spent a few hours today learning about a statistical model used for natural language processing (NLP) called n-gram(s) or ngrams, and ended up with pretty abysmal results. It all depends on the quality of the dataset, and then it can only predict what it has seen.

⤋ Read More