Observation: If your blog post starts with a clearly AI-generated abstract, why would I be motivated to read the rest of it? It's unfortunately indistinguishable from AI slop.
I found this tool (repo2file) helpful for my workflows - quickly giving context for questions to my local LLM about my working (small) repo in the terminal.
Until I saw this post, I wasn't aware of any of those.
What makes his better? Since you're asking, I tried these and here's my verdict:
- [files-to-prompt](https://github.com/simonw/files-to-prompt) (from the GOAT simonw)
--> There's no option to specify files to include, must work backwards with ignore option
- [1filellm](https://github.com/jimmc414/1filellm)
--> Many dependencies and complicated setup(have to setup GitHub access token which I've never done)
It’s very strange. I went down this rabbit hole a few months ago and this HN comment (https://news.ycombinator.com/item?id=19828702) was the best summary I could find of the situation.
TLDR: archive.is doesn’t play nice with its DNS results and Cloudflare refuses to fix it on their end.
This is the fault of archive.is — they’re using an outdated DNS load balancing approach that is inherently not robust. They’re mad at Cloudflare specifically but the reality is that there are many other similar failure scenarios.
The mature thing to do would be to switch to something like anycast IP routing, which is robust against issues like this.
I suspect the “problem” isn’t actually that big a deal to begin with, and the archive folks are making a mountain out of a molehill for philosophical reasons.
In other words they hate Cloudflare and will find any excuse to start a fight.
I would say the value is definitely there, when you consider the top links on Hooli, I mean Google, are frequently malicious. Though the biggest thing to me for Kagi is the ability to uprank and downrank various domains, so I can ensure the results I am actually looking for tend to be on the top, and sites which don't cater content I need personally filter downwards.