So many tasks are classification tasks in disguise. Answering multiple-choice questions Sentiment analysis Choosing the best tool(s) for an AI agent Labeling training data LLMs are excellent reasoning engines. But nudging them to the desired output is challenging. They might return categories outside the ones that you determined. They might return multiple categories when you only want one (or the opposite — a single category when you want multiple). Even if you steer the AI toward the correct answer, parsing the output can be difficult. Asking the LLM to output structure data works 80% of the time. But the 20% of the time that your code parses the response fails takes up 99% of your time and is unacceptable for most real-world use cases.
Categorization and Classification with LLMs
Hey Matt, just curious - are we seeing a new product/company from you or is this just a project?
Can you speak regarding your technical approach to data structuredness server-side here (or is it secret)?