Voozh

SemiAnalysis

2,493 posts

👁 Image

👁 user avatar

SemiAnalysis

@SemiAnalysis_

Joined January 2024

👁 user avatar
SemiAnalysis
@SemiAnalysis_
36m
Great work to @vllm_project team and @nvidia on smooth, out-of-the-box day 0 @MiniMax_AI M3 experience with @inferact EAGLE3 spec decode. Here are the details of ongoing M3 workstream: NVIDIA, Inferact and SemiAnalysis are working hard on enabling disaggregated inferencing (PR
👁 user avatar
NVIDIA AI Infrastructure
👁 NVIDIA
@NVIDIAAIInfra
5h
📣: MiniMax M3 has landed, joining models like DeepSeek V4 and Kimi-K2.6 at the frontier of open agentic models — and NVIDIA Blackwell is already delivering leading performance on it. NVIDIA Blackwell Ultra delivers up to 5x higher AI factory throughput than NVIDIA Hopper on
👁 Image
👁 user avatar
SemiAnalysis
@SemiAnalysis_
1h
Wide Expert Parallelism increases the total memory bandwidth available per MoE deployment. This means the model distributes the MoE expert weights across multiple GPUs, so each GPU only needs to load a tiny fraction of the weights. This translates to higher throughput per GPU,
👁 Image
00:00
👁 user avatar
SemiAnalysis
@SemiAnalysis_
5h
The US has imported more from Taiwan than from China since November 2025. That headline means both more and less than it appears. A thread on why AI infrastructure has made trade accounting genuinely hard. (1/6)🧵
👁 Image
👁 user avatar
SemiAnalysis
@SemiAnalysis_
5h
Replying to @SemiAnalysis_
GDPNow and Street economists are presumably add-factoring this, but GDP forecast misses have been notably large the last few years (tariffs and oil share the blame). The iPhone always had this value-decomposition issue — it was just too small to move headline GDP. AI imports
👁 user avatar
SemiAnalysis
@SemiAnalysis_
5h
Separately: oil. The Iran war price spike has inflated nominal US petroleum exports. No GDP impact — the investment response is missing from rig counts — but an enormous dollar flow to US asset holders at these prices. (6/6)
👁 Image
👁 user avatar
SemiAnalysis
@SemiAnalysis_
19h
Offer me Money, Offer me Power. I don't care... I AM BACK!!!
👁 Image
00:00
👁 user avatar
SemiAnalysis
@SemiAnalysis_
23h
Analyzing Internal SemiAnalysis usage, Claude still mogs for coding & deep research. Even though Codex has a better Desktop app UI, Claude still has better adoption.
👁 Image
👁 user avatar
SemiAnalysis
@SemiAnalysis_
Jun 16
RL Systems Mind the Gap: Matching Trainer and Generator Throughput RL Training Infrastructure, GRPO, PipelineRL, Async RL, Policy Staleness, RL Sandbox Infra, CPU Requirements, TCO Analysis, Thinking Machines Tinker
👁 Image
RL Systems Mind the Gap: Matching Trainer and Generator Throughput
From newsletter.semianalysis.com
👁 user avatar
SemiAnalysis
@SemiAnalysis_
Jun 16
ALERT: OpenAI's CFO claims their next big training run will happen in Fall 2026 on Vera Rubin but that doesn't add up. Rubin NVL72 clusters likely won't be stable enough by then, and the software stack won't be mature enough to support a true "big training run." Rubin may be
👁 Image
00:00
👁 user avatar
SemiAnalysis
@SemiAnalysis_
Jun 16
With Fable/Mythos getting banned, Anthropic can just take a page out of the Inspur/Aivres playbook and change their name to avoid US government sanctions! On June 8th the DoD published its updated Section 1260H list of Chinese military companies operating in the United States.
👁 Image
👁 user avatar
SemiAnalysis
@SemiAnalysis_
Jun 16
Replying to @SemiAnalysis_
Don't take our word that Aivres == Inspur. Attached is source from the Inspur (000977) 2020 prospectus. Translated to English: Shandong SASAC --> Inspur Group (the entity on the 1260H list) --> Inspur Electronic/IEIT --> Aivres (4/5)
👁 Image
👁 Image
👁 user avatar
SemiAnalysis
@SemiAnalysis_
Jun 16
This isn't the first time. In April 2024, HPE sued Inspur Group, IEIT, and Aivres in the Northern District of California for infringing five server patents. They alleged that Inspur kept selling the accused servers in the US through its Milpitas affiliate after being
👁 Image
👁 user avatar
SemiAnalysis
@SemiAnalysis_
Jun 15
Haroon from DG Matrix stops by this week to answer the teams questions about how 800VDC is about to change the electrical infrastructure of the datacenter. @JeremieEO @JordanNanos and Nicolas Bontigui
👁 Image
00:00
👁 user avatar
SemiAnalysis
@SemiAnalysis_
Jun 15
👁 user avatar
SemiAnalysis
@SemiAnalysis_
Jun 15
China is Mogging Western Auto, and that’s Bad for Semis, National Security & War If you live anywhere outside the US, you've noticed it: the streets are filling up with cars you've never seen before. Chery? Jaecoo? Zeekr? Leapmotor? BYD? No, you didn't miss a decade of car
👁 Image
👁 user avatar
SemiAnalysis
@SemiAnalysis_
Jun 15
Replying to @SemiAnalysis_
China understands this perfectly. Hollowing out Western auto manufacturing isn't just a commercial strategy for auto production excellence, it erodes the West's ability to mobilize in a future conflict, while China builds the largest industrial surge capacity on earth. (9/10)
👁 Image
00:00
👁 user avatar
SemiAnalysis
@SemiAnalysis_
Jun 15
There are no perfect solutions to this growing phenomenon, but it’s important to be aware of it’s existence and of it’s potential implications. For more supply-chain intelligence – check out ChipBook (10/10)
👁 SemiAnalysis ChipBook
SemiAnalysis ChipBook
From semianalysis.com

URL: https://x.com/SemiAnalysis_

⇱ SemiAnalysis (@SemiAnalysis_) / X