Cerebras Systems today announced what it said is record-breaking performance for DeepSeek-R1-Distill-Llama-70B inference, achieving more than 1,500 tokens per second – 57 times faster than GPU-based solutions. Cerebras said this speed enables instant reasoning capabilities for one of the industry’s most sophisticated …

Cerebras Reports Fastest DeepSeek R1 Distill Llama 70B Inference Read more »

https://orionx.net/wp-content/uploads/2025/02/HPCNB_20250203.mp3 Happy February to one and all! The HPC-AI world was upended last week by AI benchmark numbers from DeepSeek, as the dust settles we offer a brief  (8:12) commentary on what, at this stage, it may mean: – Five …

News Bytes Podcast 20250203: DeepSeek Lessons, Intel Reroutes GPU Roadmap, LLNL and OpenAI for National Security, Nuclear Reactors for Google Data Centers Read more »