Working on AI models? Here’s how much data you’ll need to tune them up – Portland Oregon startups, tech, news, events, jobs, and community

While any number of AI solutions are leveraging existing Large Language Models, there are still a number of startup folks creating their own models — when they can find the compute to do so. But what does it take to create those models and how does one fine tune them so that they behave as expected? Portland serial founder Devin Gaffney shared his thoughts.

To “make” a model a “specialist”, we engage in a “finetuning” procedure. Lots of companies provide this as an extension of their core models, but we can also just as easily do it with offline models. When we “fine-tune” a model, we are, in essence, teaching a machine to shift its attentional dimensions to the sub-territory of linguistic space on a “map” – we’re doing the model equivalent of zooming into a map to consider the detail we actually care about.

It’s still early days with LLMs, relatively speaking. It’s become super easy to deploy models, run them offline, and play with their output, but there’s surprisingly few rigorous, concrete results when asking a question like “How much data is enough for finetuning an LLM, though?”

For Devin’s thoughts, read “How Much Data is Enough for Finetuning an LLM?“

More Portland startup news

Category: Portland

Published by Rick Turoczy

More than mildly obsessed with the Portland startup community. Founder and editor at Silicon Florist. Cofounder and general manager at PIE. Follow me on Twitter: @turoczy View all posts by Rick Turoczy

More Portland startup news

Published by Rick Turoczy

Related

Discover more from Portland Oregon startups, tech, news, events, jobs, and community