More

yoeven · 2025-06-02T21:24:14 1748899454

This is awesome, I've been following time series forecasting a ton so build a zero-shot time series model that allows you to pass in historical data and provide a 98% accurate forecast compared to custom training your own model with custom dataset. https://jigsawstack.com/ai-prediction

if you find it interesting, you could add it to your articles too as a comparison :)

yoeven · 2025-05-21T03:30:57 1747798257

How do you solve the access control part? Did you build the layer ontop or more like a auth redirect?

JonanJ · 2025-05-21T17:18:17 1747847897

Using OAuth2.0 standard for authentication & authorization then using the access token from that to call our APIs to track usage!

yoeven · 2025-03-21T16:38:15 1742575095

I think you can try the model

yoeven · 2025-03-07T12:15:45 1741349745

I ran Mistral AI OCR against JigsawStack OCR and beat their model in every category. Full breakdown here: https://jigsawstack.com/blog/mistral-ocr-vs-jigsawstack-vocr

27theo · 2025-03-07T12:18:51 1741349931

Just a small fyi, as viewed on an iPhone in Safari your tables don’t allow horizontal scrolling, cutting off the right column

yoeven · 2025-02-27T18:13:36 1740680016

Looks great!! I would love to move to this as my go-to API client but my fear is it stops getting maintained overtime like httpie.io but at least this is open source so a great win!

yoeven · 2025-02-27T18:11:47 1740679907

Are there plans for future integrations into messaging platforms like Whatsapp or Telegram?

yoeven · 2025-02-27T18:09:37 1740679777

Cool stuff!

yoeven · 2025-02-02T15:38:58 1738510738

Multimodal is the ability to handle different type of inputs like images, pdf, text... You can do a quick google if you'd like to understand the meaning. Here is an article if it helps: https://www.splunk.com/en_us/blog/learn/multimodal-ai.html

yoeven · 2025-02-02T13:24:02 1738502642

I love the simplicity and wish there was more users to make this the true alternative

yoeven · 2025-02-02T13:13:02 1738501982

It's a framework that uses the best part of each LLM, e.g. multimodal support from gemini with tool calling from gpt-4o and reasoning from o3-mini by chaining them dynamically. From a user perspective, there is no model selection or routing, just write the prompt or upload a file and it works so it feels like you're working with a single LLM but under the hood it does all this work to get you the best output :) Sorry if you felt it's misleading but I hope you give it a shot!

danielbln · 2025-02-02T13:38:22 1738503502

The problem with that phrasing is that there is actual model merging, where you merge the weights. So people reading the title might (and apparently do) expect that, less so an LLM router.

vikramkr · 2025-02-02T15:37:41 1738510661

Makes sense but the problem is that you're using words that already have specific meanings in the space, all related to creating one model with multiple functionalities. Merging meaning merging models into one model. Multi modal meaning one llm that handles multiple modes. The term you want is probably agent or framework or chain or something. Basically, what you describe is when it feels like you're only working with one model. What your title says is when you engineer specifically actually only one model, which is a distinct technical challenge.

yoeven · 2025-02-02T15:42:30 1738510950

I 100% agree, this simulates a multimodal input and automatically handles the rest along with model selection by using a variation of techniques. It doesn't do this natively on the model level

OutOfHere · 2025-02-02T17:21:48 1738516908

You are still not getting it. The use of the word multimodal does nothing good for your software. It is an LLM router. I get it that your software does support some multimodal LLMs, but that is incidental.

Secondly, the use of the word "merging" is also grossly misleading. You are not merging LLMs, only routing requests.