It's live on openrouter now. In my personal benchmark it's bad. So far the bench...

manofmanysmiles · 2026-02-11T18:04:34 1770833074

I love the idea of chat.md.

I'm developing a personal text editor with vim keybindings and paused work because I couldn't think of a good interface that felt right. This could be it.

I think I'll update my editor to do something like this but with intelligent "collapsing" of extra text to reduce visual noise.

pcwelder · 2026-02-12T10:24:45 1770891885

Cool! Please share your work if possible!

I couldn't decide on folding and reducing noise so I'm stuck on that front. I believe there is some elegant solution that I'm missing, hope to see your take.

data-ottawa · 2026-02-11T18:29:28 1770834568

Custom tool calling formats are iffy in my experience. The models are all reinforcement learned to follow specific ones, so it’s always a battle and feels to me like using the tool wrong.

Have you had good results with the other frontier models?

thegeomaster · 2026-02-11T23:35:35 1770852935

Not the parent commenter, but in my testing, all recent Claudes (4.5 onward) and the Gemini 3 series have been pretty much flawless in custom tool call formats.

data-ottawa · 2026-02-11T23:57:04 1770854224

Thanks.

I’ve tested local models from Qwen, GLM, and Devstral families.

pcwelder · 2026-02-12T07:16:28 1770880588

All anthropic models. Gemini 2.5 pro and above. Gemini 3 flash is very good too.

GPT models can follow tool format correctly but don't keep on going.

Grok-4+ are decent but with issues in longer chats.

Kimi 2.5 has issues with it reverting to its RL tool format.

nolist_policy · 2026-02-11T18:08:05 1770833285

Could also be the provider that is bad. Happens way too often on OpenRouter.

pcwelder · 2026-02-11T18:12:01 1770833521

I had added z-ai in allow list explicitly and verified that it's the one being used.

sergiotapia · 2026-02-11T18:25:30 1770834330

Be careful with openrouter. They routinely host quantized versions of models via their listed providers and the models just suck because of that. Use the original providers only.

nullbyte · 2026-02-11T20:04:17 1770840257

I specifically do not use the CN/SG based original provider simply because I don't want my personal data traveling across the pacific. I try to only stay on US providers. Openrouter shows you what the quantization of each provider is, so you can choose a domestic one that's FP8 if you want

sschueller · 2026-02-12T05:28:48 1770874128

Funny, living in Europe, I prefer using EU and Chinese hosts because as I don't want my data going to the US.

The trust in US firms and state is completely gone.

mycall · 2026-02-13T18:49:55 1771008595

Living in the US, my trust in US firms and state is also completely gone. My only hope is local LLMs.

lostmsu · 2026-02-12T10:52:53 1770893573

Tangent note: this sounds like the same mistake as EU's reliance on Russia.

lossolo · 2026-02-12T11:31:05 1770895865

Not really. China doesn't share a border with us, doesn't claim any EU territory, and didn't historically rule our lands the way the USSR did. In the context of spheres of influence and security interests, its strategic goals aren't directly at odds with the EU's core interests.

lostmsu · 2026-02-12T14:49:21 1770907761

EU is not a singular country, and Germany or France don't border Russia either.

Considering China is ok to supply Russia, I don't see how your second point has any standing either.

lossolo · 2026-02-12T15:33:41 1770910421

> EU is not a singular country, and Germany or France don't border Russia either.

But soon they could, that's the problem.

> Considering China is ok to supply Russia, I don't see how your second point has any standing either.

Supply? China supplies Ukraine too. Ukraine's drone sector runs heavily on Chinese supply chains. And if China really wanted to supply Russia, the war would likely be over by now, Russia would have taken all of Ukraine.