Combining LLMs Rarely Beats the Single Best Model ๐ฒ beta=P(all wrong): the co-failure ceiling on LLM ensembles
The Physical AI Inference Gap in Batch-1 LLM Decode ๐ช Interactive companion to the batch-1 LLM decode paper