Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
Show HN: MaximusLLM – Train 262k-vocab LLMs on a single 16GB GPU (github.com/yousef-rafat)
2 points by yousef_g 18 days ago | past
Ghost Logits: Simulating missing partition mass in sampled softmax [pdf] (github.com/yousef-rafat)
1 point by yousef_g 19 days ago | past
Show HN: MaximusLLM, Breaking transformer's O(N^2) and O(V) scaling bottlenecks (github.com/yousef-rafat)
1 point by yousef_g 21 days ago | past
MaximusLLM: High-Speed Architecture via Ghost Logits and Random Latent Attention (github.com/yousef-rafat)
1 point by yousef_g 22 days ago | past
I have reimplemented Stable Diffusion 3.5 from scratch in pure PyTorch (github.com/yousef-rafat)
481 points by yousef_g 9 months ago | past | 77 comments
Magna: Embedding similarity search tool for searching within large documents (github.com/yousef-rafat)
14 points by yousef_g on Jan 5, 2025 | past
RustyChat: Asynchronous local chat server written in Rust (github.com/yousef-rafat)
2 points by yousef_g on Dec 31, 2024 | past
Open Source Twitter Bot (github.com/yousef-rafat)
4 points by yousef_g on Sept 15, 2024 | past | 1 comment

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: