Audi stopped Q8 e-tron production in early 2025. I don't know how much allocation the US has had of the semi-replacement (S)Q6, and A6 was not launched at all.
Q4 is a bit weird, since it's just a more expensive ID. 4, and not exactly more premium. Actually less premium feel than the sister car Skoda Enyaq, but that's not available in the US.
They're a bit out-of-phase with BMW and Mercedes right now, who just opened the books on their new platform cars. Perhaps you could argue it was bad timing with the Q6 being a bit of an "inbetweener", but the PPE platform was delayed, to be fair.
The US market is extremely regressive due to the changing regulatory environment. I fully expect new ICE cars without catalytic converters in the near future.
This is not representative of the rest of the world.
They might be happy that they can keep making V8s, but they have to know any future administration could easily outlaw any design that goes too far backwards. Such a car will also not be able to be sold anywhere else in the world. Heck, by the time they design, tool, and produce such a beast it could already be too late.
The demand for EVs is crashing across the board. Porsche for example is now in dire straits because they had promised to make the 718 only as EV and with demand going down, they'll revamp the platform and get ICE 718s back.
Unless the feds can take California out of the regulatory picture, I don't expect major steps backward. Almost half the country adopts California's emissions standards.
DRAM speeds is one thing, but you should also account for the data rate of the PCIe bus (and/or VRAM speed). But yes, holding it "lukewarm" in DRAM rather than on NVMe storage is obviously faster.
Four channels of DDR4-3200 vs two channels of DDR5-6400 (four subchannels) should come out pretty close. I don't see any reason why the DDR4 configuration would be consistently faster; you might have more bank groups on DDR4, but I'm not sure that would outweigh other factors like the topology and bandwidth of the interconnects between the memory controller and the CPU cores.
LLama 3.1 however is not MoE, so all params are active.
For MoE it is tricky, because for each token you only use a subset of params (an “expert”) but you don’t know which one, so you have to keep them all in memory or wait until it loads from slower storage, potentially different for each token.
> Rebecca Beth Bauer-Kahan (née Bauer; born October 28, 1978) is an American attorney and politician who has served as a member of the California State Assembly from the 16th district since 2018. A member of the Democratic Party, her district extends from Lamorinda to the Tri-Valley region of the San Francisco Bay Area. She has been described as a women's rights advocate.
It sets the price floor and provides liquidity, so the phone doesn’t go into a trash bin instead.
reply