Back in May, I got Vicuna 7B — a chat-tuned version of the original Llama model, working entirely in the browser via the new WebGPU APIs that had shipped in Chrome.
Hey Matt, I used Thiggle and it is awesome! One request: Structured API is the most exciting part for me and I am used to pay-as-you-go + monitoring dashboards for myself in LLM workflows. It would be cool to see a pay-as-you-go option for the structured API.
Hey Matt, I used Thiggle and it is awesome! One request: Structured API is the most exciting part for me and I am used to pay-as-you-go + monitoring dashboards for myself in LLM workflows. It would be cool to see a pay-as-you-go option for the structured API.