| | Ggwave: Tiny Data-over-Sound Library (github.com/ggerganov) |
| 284 points by LorenDB 9 months ago | past | 72 comments |
|
| | Whisper.cpp: Looking for Maintainers (github.com/ggerganov) |
| 3 points by tech234a 9 months ago | past |
|
| | Llama.cpp now supports tool calling (OpenAI-compatible) (github.com/ggerganov) |
| 3 points by ochafik 9 months ago | past | 1 comment |
|
| | Ollama are 'try[ing to] achieve vendor lock-in' (github.com/ggerganov) |
| 17 points by alexmorley 9 months ago | past | 5 comments |
|
| | Ggml: X2 speed for WASM by optimizing SIMD (github.com/ggerganov) |
| 1 point by btilly 9 months ago | past | 2 comments |
|
| | DeepSeek-R1 speeds up llama.cpp code by x2 (github.com/ggerganov) |
| 6 points by roboboffin 9 months ago | past | 3 comments |
|
| | Llama.cpp PR with 99% of code written by DeepSeek-R1 (github.com/ggerganov) |
| 4 points by zelag 9 months ago | past |
|
| | Ggml 2x WASM Speed with SIMD Optimization Using 99% DeekSeek-R1-Generated Code (github.com/ggerganov) |
| 7 points by bratao 9 months ago | past |
|
| | Train a Mnist VAE with C and CUDA (github.com/ggerganov) |
| 54 points by bssrdf 11 months ago | past | 2 comments |
|
| | Llama.cpp Now Supports Qwen2-VL (Vision Language Model) (github.com/ggerganov) |
| 155 points by BUFU 11 months ago | past | 50 comments |
|
| | Llama.vim: Plugin for Neovim (github.com/ggerganov) |
| 2 points by mariuz on Oct 22, 2024 | past |
|
| | Llama.vim: Plugin for Neovim (github.com/ggerganov) |
| 2 points by ibobev on Oct 21, 2024 | past |
|
| | Attention and final logit soft-capping, update scaling factor to Gemma2 (github.com/ggerganov) |
| 2 points by tosh on July 1, 2024 | past |
|
| | Distributed LLM Inference with Llama.cpp (github.com/ggerganov) |
| 3 points by tosh on May 24, 2024 | past |
|
| | New exponent functions that make SiLU and SoftMax 2x faster, at full accuracy (github.com/ggerganov) |
| 382 points by weinzierl on May 15, 2024 | past | 72 comments |
|
| | ggml: Add Flash Attention (github.com/ggerganov) |
| 2 points by tosh on May 13, 2024 | past |
|
| | Acoustic Keyboard Eavesdropping (github.com/ggerganov) |
| 1 point by behnamoh on May 11, 2024 | past |
|
| | llama.cpp bfloat16 support (github.com/ggerganov) |
| 2 points by indigodaddy on April 30, 2024 | past |
|
| | GGML Flash Attention support merged into llama.cpp (github.com/ggerganov) |
| 3 points by smcleod on April 30, 2024 | past | 1 comment |
|
| | Llama.cpp Working on Support for Llama3 (github.com/ggerganov) |
| 7 points by theolivenbaum on April 18, 2024 | past |
|
| | Llama.cpp: Improve CPU prompt eval speed (github.com/ggerganov) |
| 1 point by tosh on April 17, 2024 | past |
|
| | Llama.cpp: Mac Prebuilds (github.com/ggerganov) |
| 2 points by tosh on March 22, 2024 | past |
|
| | Grok-1 Support for Llama.cpp (github.com/ggerganov) |
| 11 points by schappim on March 22, 2024 | past | 2 comments |
|
| | Control Vectors have been added to llama.cpp (github.com/ggerganov) |
| 3 points by Der_Einzige on March 16, 2024 | past |
|
| | Gemma Is Added to Llama.cpp (github.com/ggerganov) |
| 17 points by behnamoh on Feb 21, 2024 | past |
|
| | Llama.cpp supports distributed inference across machines on a local network (github.com/ggerganov) |
| 3 points by behnamoh on Jan 27, 2024 | past |
|
| | Llama.cpp incoming backends: Vulkan, Kompute, SYCL (github.com/ggerganov) |
| 2 points by irusensei on Jan 27, 2024 | past |
|
| | Llama.cpp: Self-Extend Support (github.com/ggerganov) |
| 2 points by tosh on Jan 9, 2024 | past |
|
| | Llama.cpp: SOTA 2-bit quants (github.com/ggerganov) |
| 5 points by tosh on Jan 7, 2024 | past |
|
| | GGUF File Format (github.com/ggerganov) |
| 2 points by warkanlock on Dec 31, 2023 | past |
|
|
| More |