| 1. | | The Smol Training Playbook: The Secrets to Building World-Class LLMs (huggingface.co) |
| 265 points by kashifr 12 days ago | past | 19 comments |
|
| 2. | | Unlocking On-Policy Distillation for Any Model Family (huggingface.co) |
| 6 points by kashifr 13 days ago | past | 1 comment |
|
| 3. | | Transformers 4.55 New OpenAI GPT OSS (github.com/huggingface) |
| 2 points by kashifr 3 months ago | past | 1 comment |
|
| 4. | | Smollm3: Smol, multilingual, long-context reasoner LLM (huggingface.co) |
| 388 points by kashifr 4 months ago | past | 79 comments |
|
| 5. | | Epic vs. Apple (twitter.com/dhh) |
| 7 points by kashifr 6 months ago | past |
|
| 6. | | AIMO (AI Math Olympiad) progress prize winning solution (huggingface.co) |
| 9 points by kashifr on July 10, 2024 | past |
|
| 7. | | MaPO: A reference-free alignment technique for diffusion models (mapo-t2i.github.io) |
| 2 points by kashifr on June 11, 2024 | past | 1 comment |
|
| 8. | | OpenHermesPreferences: Dataset of ~1M AI preferences from teknium/OpenHermes-2.5 (huggingface.co) |
| 7 points by kashifr on Feb 26, 2024 | past | 1 comment |
|
| 9. | | HuggingFace Training Cluster as a Service (huggingface.co) |
| 101 points by kashifr on Sept 5, 2023 | past | 45 comments |
|
| 10. | | HuggingFace 235M series D at a $4.5B valuation (twitter.com/clementdelangue) |
| 3 points by kashifr on Aug 24, 2023 | past |
|
| 11. | | Fine-tune Llama 2 with DPO (huggingface.co) |
| 3 points by kashifr on Aug 8, 2023 | past |
|
| 12. | | QLoRA 4-bit finetuning of LLMs (github.com/artidoro) |
| 7 points by kashifr on May 24, 2023 | past | 1 comment |
|
| 13. | | StackLlama: A hands-on guide to train LlaMa with RLHF (huggingface.co) |
| 165 points by kashifr on April 6, 2023 | past | 38 comments |
|
| 14. | | HuggingFace Diffusers 0.2 with Stable Diffusion pipeline (github.com/huggingface) |
| 2 points by kashifr on Aug 16, 2022 | past | 1 comment |
|
| 15. | | Diffusers: Modular Diffusion model library from HuggingFace (github.com/huggingface) |
| 47 points by kashifr on July 21, 2022 | past | 5 comments |
|
| 16. | | Generic Neural Elastic Search (gnes.ai) |
| 4 points by kashifr on July 26, 2019 | past |
|
| 17. | | PyTorch 1.0 is out (github.com/pytorch) |
| 470 points by kashifr on Dec 7, 2018 | past | 70 comments |
|
| 18. | | Pytorch 0.4.0 is out (github.com/pytorch) |
| 36 points by kashifr on April 24, 2018 | past | 7 comments |
|
| 19. | | An MNIST-like fashion product dataset (github.com/zalandoresearch) |
| 220 points by kashifr on Aug 28, 2017 | past | 21 comments |
|
| 20. | | AMS Sketch Algorithm (2014) (fu-berlin.de) |
| 7 points by kashifr on Feb 16, 2015 | past |
|
| 21. | | Morris Algorithm (2014) (fu-berlin.de) |
| 43 points by kashifr on Feb 9, 2015 | past | 4 comments |
|
| 22. | | Show HN: etcml easy text classification with machine learning service (etcml.com) |
| 4 points by kashifr on Dec 3, 2013 | past |
|
| 23. | | Async. & Realtime Geo Applications with Node.js (video) (fosslc.org) |
| 4 points by kashifr on Nov 4, 2011 | past |
|