Sunday, 22 February 2026

New best story on Hacker News: Show HN: Llama 3.1 70B on a single RTX 3090 via NVMe-to-GPU bypassing the CPU

Show HN: Llama 3.1 70B on a single RTX 3090 via NVMe-to-GPU bypassing the CPU
358 by xaskasdf | 93 comments on Hacker News.
Hi everyone, I'm kinda involved in some retrogaming and with some experiments I ran into the following question: "It would be possible to run transformer models bypassing the cpu/ram, connecting the gpu to the nvme?" This is the result of that question itself and some weekend vibecoding (it has the linked library repository in the readme as well), it seems to work, even on consumer gpus, it should work better on professional ones tho

New best story on Hacker News: How Taalas “prints” LLM onto a chip?

How Taalas “prints” LLM onto a chip?
365 by beAroundHere | 215 comments on Hacker News.


New best story on Hacker News: Claws are now a new layer on top of LLM agents

Claws are now a new layer on top of LLM agents
376 by Cyphase | 844 comments on Hacker News.
https://ift.tt/R2nXAyL Related: https://ift.tt/e6tZF8n

New best story on Hacker News: Across the US, people are dismantling and destroying Flock surveillance cameras

Across the US, people are dismantling and destroying Flock surveillance cameras
445 by latexr | 263 comments on Hacker News.