definitely the last thing anybody who’s used windows would associate with the system
definitely the last thing anybody who’s used windows would associate with the system


removing the interactions bit is really impressive, the model effectively acts as a physics engine figuring out how objects in the scene interact with one another


Imagine believing that countries ruled by literal pedophiles, who are completely unaccountable, have a more fair system than China or DPRK 🤡
Iranian president wrote an open letter to Americans https://time.com/article/2026/04/01/iran-war-president-open-letter-american-people/


And that’s precisely why I support Chinese system.


only in the court of public opinion


Binary quantization and 1 bit vectors have definitely been floating around the space for years. The big difference here is not necessarily just better raw precision but how they completely eliminate the hidden memory tax that usually comes with extreme compression. Normally when you crush a 32 bit float down to a single bit you destroy a massive amount of scale and range information. To make the model actually usable after that traditional methods usually have to store extra full precision numbers alongside those compressed blocks to act as scaling factors or zero points. So your theoretical 1 bit compression actually ends up costing something like 2 or 3 bits per parameter in practice.
TurboQuant gets around this by using the Quantized Johnson Lindenstrauss transform which is basically a mathematical guarantee that the relative distances between different data points will be preserved even when the data is aggressively shrunk. By doing this and dropping everything to just a positive or negative sign bit they completely remove the need to store any full precision scaling factors. It literally has zero memory overhead. To make sure the attention mechanism still works they use a special estimator that takes a high precision query and runs it against that low precision 1 bit cache in a way that mathematically eliminates bias.
You also have to look at how they are actually applying it in the pipeline. They don’t just take the raw 32 bit vector and smash it down to 1 bit right out of the gate. They use that PolarQuant method first to map everything to polar coordinates and capture the main structure and strength of the vector. The 1 bit QJL algorithm is only deployed at the very end as a targeted cleanup to fix residual errors left over from the first step.


I looked at this recently, and it really doesn’t look like any of them work as a daily driver. I ended up going with GrapheneOS in the end.
huh?
People in China enjoy genuine human rights, like right to housing, education, and healthcare. 90% of families in the country own their home giving China one of the highest home ownership rates in the world. What’s more is that 80% of these homes are owned outright, without mortgages or any other leans. https://www.forbes.com/sites/wadeshepard/2016/03/30/how-people-in-china-afford-their-outrageously-expensive-homes
The real (inflation-adjusted) incomes of the poorest half of the Chinese population increased by more than four hundred percent from 1978 to 2015, while real incomes of the poorest half of the US population actually declined during the same time period. https://www.nber.org/system/files/working_papers/w23119/w23119.pdf
From 1978 to 2000, the number of people in China living on under $1/day fell by 300 million, reversing a global trend of rising poverty that had lasted half a century (i.e. if China were excluded, the world’s total poverty population would have risen) https://www.semanticscholar.org/paper/China’s-Economic-Growth-and-Poverty-Reduction-Angang-Linlin/c883fc7496aa1b920b05dc2546b880f54b9c77a4
In fact, people in China enjoy high levels of social mobility in general https://www.nytimes.com/interactive/2018/11/18/world/asia/china-social-mobility.html
Student debt in China is virtually non-existent because education is not run for profit. https://www.forbes.com/sites/jlim/2016/08/29/why-china-doesnt-have-a-student-debt-problem/
China massively invests in public infrastructure. They used more concrete in 3 years than US in all of 20th century https://www.forbes.com/sites/niallmccarthy/2014/12/05/china-used-more-concrete-in-3-years-than-the-u-s-used-in-the-entire-20th-century-infographic/
China also built 27,000km of high speed rail in a decade https://www.railjournal.com/passenger/high-speed/ten-years-27000km-china-celebrates-a-decade-of-high-speed/
All these things translate into tangible freedoms allowing people to live their lives to the fullest. Freedom can be seen as the measure of personal agency an individual enjoys within the framework of society. A good measure of whether people genuinely feel free is to look at what people of the country have to say on the subject. Even as mainstream western media openly admits, people in China overwhelmingly see their system as being democratic, and the government enjoys broad public trust and support.
same, any time I look for uplifting news, it’s inevitably from China


oops fixed


just wait for the bubble to pop, and I’m sure we’ll see a lot of affordable GPUs flood the market from the abandoned data centres :)


That’s part of the idea with the whole mixture of experts (MoE) approach in newer models actually.
Rather than using a single neural net that’s say 512 wide, you split it into eight channels/experts of 64. If the neural net can pick the correct channel for each inference, then you only have to run 1/8th of the neurons on every forward pass. Of course, once you have your 8 channels/experts in parallel, you now need to decide which expert/channel to use for each token you want to process. This is called a router which takes in an input and decides which expert/channel to send it to. The router itself is a tiny neural network. It is a matrix that converts the input vectors to a router choice. And the router itself has a small set of trainable weights that gets trained together with the MoE.


Oh I see we have a commenter from a genocide supporting instance. So brave.


It’s true, the numbers for the US regime are likely grossly undercounted just as they were for COVID https://gizmodo.com/updated-death-toll-shows-the-us-grossly-undercounted-covid-deaths-early-on-2000734587


The trick they use is pretty clever. When you ask an AI to write code, it doesn’t always get it right. Sometimes the code has bugs, sometimes it misunderstands the problem entirely. A naive way to address that is to generate a few solutions and test each one. The odds that at least one works go way up. ATLAS generates multiple attempts, running each through a test suite. Each retry also gets told what went wrong with the previous attempt, so it can try to avoid the same mistake.
But this can be pretty slow since you have to run the code in an isolated environment, check the outputs, wait for it to finish. Doing that for every candidate quickly adds up. So ATLAS has another shortcut for avoiding unnecessary testing. Instead of simply generating solutions and testing all of them, it tries to predict which one is most likely correct before running any tests.
ATLAS also asks the model for an embedding of what it just wrote which acts as a fingerprint. Two similar pieces of code will produce similar fingerprints. A well-written, confident solution will produce a different fingerprint than a confused, buggy one.
These fingerprints get fed into a separate, much smaller neural network called the Cost Field. This little network was trained ahead of time on examples where they already knew which solutions were correct and which were wrong. It learned to assign a score to each fingerprint. Correct solutions get a low score and incorrect ones get a high one.
So the process is to generate multiple solutions, get their fingerprints, score each one, and pick the lowest. Only that one gets tested. The Cost Field picks correctly about 88% of the time according to the repo.
Oh yeah corps will absolutely do that. We can kinda see the same thing happening with everything moving to streaming services too.
Personally, I can’t recall any time in history when Windows was genuinely robust. The NT system was probably the least worst, but it had plenty of problems as well. I completely agree that things got even worse under Nadella. Seems like MS is a complete clown show today.