Meta’s star AI scientist Yann LeCun plans to leave for own startup

Avid Amoeba@lemmy.ca · 4 months ago

Meta’s star AI scientist Yann LeCun plans to leave for own startup

Avid Amoeba@lemmy.ca · edit-2 4 months ago

Also he thinks LLMs are a dead end for getting smarter AI while Zuck is doubling down on them.

UnderpantsWeevil@lemmy.world · 4 months ago

Well, he’s got a bowtie and Zuck wears an oversized t-shirt with Bugs Bunny dressed as a 90s rapper.

They certainly can’t both be wrong, can they?

XLE@piefed.social · 4 months ago

They could both be right… From a certain point of view.

Within FAIR, LeCun has instead focused on developing world models that can truly plan and reason. Over the past year, though, Meta’s AI research groups have seen growing tension and mass layoffs as Zuckerberg has shifted the company’s AI strategy away from long-term research and toward the rapid deployment of commercial products.

LeCun says current AI models are a dead end for progress. I think he’s correct.

Zuckerberg appears to believe long term development of alternative models will be a bigger money drain than pushing current ones. I think he’s correct too.

It looks like two guys arguing about which dead end to pursue.

UnderpantsWeevil@lemmy.world · edit-2 4 months ago

LeCun says current AI models are a dead end for progress.

Sure. That’s easily proven, as the Pacific Rim tech companies are all running laps around the American models in terms of efficiency and output.

It looks like two guys arguing about which dead end to pursue

They’re both Snipe Hunting for the mythological AGI, because they’re each invested in the idea of a Singularity solving all their problems.

LLMs have a set of niche useful applications, but these dudes are chucking that advancement aside in pursuit of Digital God.

With Roko’s Basilisk bumping around in their heads, I can’t help but detect a certain religious fervor, either. We really might have folks who believe they’ll be tortured for eternity if they don’t build AI Hellraiser first.

tomiant@piefed.social · edit-2 4 months ago

Getting Smarter AI < Making More Money

Is there more money in smarter AI or in manipulating people’s voting patterns with the tools you’ve got?

I saw Suck at Trump’s inauguration, I didn’t see this Chinese feller there.

nymnympseudonym@piefed.social · 4 months ago

this Chinese feller

He’s French, actually.

This is one of the three people that basically invented Deep Learning . One of the others is Geoffrey Hinton, who got the Nobel Prize in 2024

No matter what you think of LeCun or his opinions… he’s damn well worth listening to with attention and respect.

tomiant@piefed.social · 4 months ago

Well I’ve been a damn drunk fool, then, sir.

krooklochurm@lemmy.ca · 4 months ago

Good for you for owning up to it like a grown up. Might I suggest rewarding yourself by shumming?

tomiant@piefed.social · 3 months ago

Chumming is a Quest in Escape from Tarkov. Must be level 24 to start this quest. Stash 3 Golden neck chains under the mattress next to BTR-82A in Generic Store on Interchange Stash 3 Golden neck chains in the microwave on the 3rd floor of the dorm on Customs Stash 3 Golden neck chains in the middle wooden cabin at the sawmill on Woods Eliminate 5 PMC operatives in the time period of 22:00-10: …

Mmmno.

krooklochurm@lemmy.ca · edit-2 3 months ago

Shumming.

It’s a new word I learned the other day on lemmy.

It’s when you shit and cum at the same time.

Here it is a sentence “I’m going to shun all over your face”

Or “I can’t right now, I’m shumming!”

tomiant@piefed.social · edit-2 3 months ago

It’s a new word I learned the other day on lemmy.

I wish you hadn’t.

technocrit@lemmy.dbzer0.com · edit-2 3 months ago

Nah. Nobody “invented” “deep learning”. Assuming there’s some progress of science it’s independent of any particular privileged bro. In fact this kind of “hero” worship is extremely anti-science. Nowadays this guy is just another mediocre grifter rich bro (regardless of past research that anybody could have done with a fair opportunity).

kromem@lemmy.world · 4 months ago

He’s been wrong about it so far and really derailed Meta’s efforts.

This is almost certainly a “you can resign or we are going to fire you” kind of situation. There’s no way with the setbacks and how badly he’s been wrong on transformers over the past 2 years that he is not finally being pushed out.

tio_bira@lemmy.world · 4 months ago

Mf look like a vilian from classic Who

tal@lemmy.today · edit-2 4 months ago

Meta’s chief AI scientist and Turing Award winner Yann LeCun plans to leave the company to launch his own startup focused on a different type of AI called “world models,” the Financial Times reported.

World models are hypothetical AI systems that some AI engineers expect to develop an internal “understanding” of the physical world by learning from video and spatial data rather than text alone.

Sounds reasonable.

That being said, I am willing to believe that an LLM could be part of an AGI. It might well be an efficient way to incorporate a lot of knowledge about the world. Wikipedia helps provide me with a lot of knowledge, for example, though I don’t have a direct brain link to it. It’s just that I don’t expect an AGI to be an LLM.

EDIT: Also, IIRC from past reading, Meta has separate groups aimed at near-term commercial products (and I can very much believe that there might be plenty of room for LLMs here) and aimed advanced AI. It’s not clear to me from the article whether he just wants more focus on advanced AI or whether he disagrees with an LLM focus in their afvanced AI group.

I do think that if you’re a company building a lot of parallel compute capacity now, that to make a return on that, you need to take advantage of existing or quite near-future stuff, even if it’s not AGI. Doesn’t make sense to build a lot of compute capacity, then spend fifteen years banging on research before you have something to utilize that capacity.

https://datacentremagazine.com/news/why-is-meta-investing-600bn-in-ai-data-centres

Meta reveals US$600bn plan to build AI data centres, expand energy projects and fund local programmes through 2028

So Meta probably cannot only be doing AGI work.

tomiant@piefed.social · 4 months ago

Look, AGI would require basically a human brain. LLMs are a very specific subset mimicking a (important) part of the brain- our language module. There’s more, but I got interrupted by a drunk guy who needs my attention, I’ll be back.

krooklochurm@lemmy.ca · 4 months ago

WHAT HAPPENED WITH THE DRHNK DUDE?

tomiant@piefed.social · 3 months ago

He offered me a job.

krooklochurm@lemmy.ca · 3 months ago

Did you accept?

tomiant@piefed.social · 3 months ago

From a drunk bum? Hell no. I can stay unemployed and drunk better than he can. Why would I work for him?

krooklochurm@lemmy.ca · 3 months ago

Fair play. Sounds like he could use a hand though. Maybe you can teach him how to be better at being drunk and unemployed.

tomiant@piefed.social · 3 months ago

That’s gonna cost 'ya!

just_another_person@lemmy.world · 4 months ago

LLMs are just fast sorting and probability, they have no way to ever develop novel ideas or comprehension.

The system he’s talking about is more about using NNL, which builds new relationships to things that persist. It’s deferential relationship learning and data path building. Doesn’t exist yet, so if he has some ideas, it may be interesting. Also more likely to be the thing that kills all human.

Communist@lemmy.frozeninferno.xyz · 4 months ago

https://blog.google/technology/ai/google-gemma-ai-cancer-therapy-discovery/ how did it do this?

just_another_person@lemmy.world · edit-2 4 months ago

Lol 🤣 I’m SO EMBARRASSED. You’re totally right and understand these things better than me after reading a GOOGLE BLOG ABOUT THEIR PRODUCT.

I’ll never speak to this topic again since I’ve clearly been bested with your knowledge from a Google Blog.

Communist@lemmy.frozeninferno.xyz · edit-2 4 months ago

yes, google reported about their ai discovering a novel cancer treatment, of course they did?

now tell me about how it isn’t true. Do you have anything of substance to discredit this?

this reeks of confirmation bias, did you even try to invalidate your preconcieved notions?

just_another_person@lemmy.world · edit-2 4 months ago

I sure do. Knowledge, and being in the space for a decade.

Here’s a fun one: go ask your LLM why it can’t create novel ideas, it’ll tell you right away 🤣🤣🤣🤣

LLMs have ZERO intentional logic that allow it to even comprehend an idea, let alone craft a new one and create relationships between others.

I can already tell from your tone you’re mostly driven by bullshit PR hype from people like Sam Altman , and are an “AI” fanboy, so I won’t waste my time arguing with you. You’re in love with human-made logic loops and datasets, bruh. There is not now, nor was there ever, a way for any of it to become some supreme being of ideas and knowledge as you’ve been pitched. It’s super fast sorting from static data. That’s it.

You’re drunk on Kool-Aid, kiddo.

Communist@lemmy.frozeninferno.xyz · edit-2 4 months ago

You sound drunk on kool-aid, this is a validated scientific report from yale, tell me a problem with the methodology or anything of substance.

so what if that’s how it works? It clearly is capable of novel things.

just_another_person@lemmy.world · edit-2 4 months ago

🤦🤦🤦 No…it really isn’t:

Teams at Yale are now exploring the mechanism uncovered here and testing additional AI-generated predictions in other immune contexts.

Not only is there no validation, they have only begun even looking at it.

Again: LLMs can’t make novel ideas. This is PR, and because you’re unfamiliar with how any of it works, you assume MAGIC.

Like every other bullshit PR release of it’s kind, this is simply a model being fed a ton of data and running through thousands of millions of iterative segments testing outcomes of various combinations of things that would take humans years to do. It’s not that it is intelligent or making “discoveries”, it’s just moving really fast.

You feed it 10² combinations of amino acids, and it’s eventually going to find new chains needed for protein folding. The thing you’re missing there is:

all the logic programmed by humans
The data collected and sanitized by humans
The task groups set by humans
The output validated by humans

It’s a tool for moving fast though data, a.k.a. A REALLY FAST SORTING MECHANISM

Nothing at any stage if developed, is novel output, or validated by any models, because…they can’t do that.

Eheran@lemmy.world · 3 months ago

Wow, you stayed way cooler than I would have. Lemmy is extremely anti-LLM or AI in general.

technocrit@lemmy.dbzer0.com · 3 months ago

this is a validated scientific report from yale

Oof. Tell me you don’t understand science without telling me you don’t understand science.

markon@lemmy.world · 3 months ago

A decade in the space is impressive. It shows dedication and time invested. That alone deserves recognition.

Still, the points you are repeating are familiar. They are recycled claims from years ago. If the goal is to critique novelty, repeating the same arguments does not advance it.

You say LLMs have zero intentional logic. That is true if by intentional logic you mean human consciousness or goals. It is false if you mean emergent behaviors and the ability to combine information in ways no single source explicitly wrote. Eliminating nuance with absolute terms makes it easy to dismiss valid evidence.

Calling someone an AI fanboy signals preference for labels over analysis. That approach does not strengthen an argument. Specific examples do. Concrete failures, reproducible tests, or papers are what advance discussion.

It is also not accurate to suggest that anyone pitches LLMs as supreme beings. Most people treat them as complex tools that produce surprising results. Their speed, scale, and capacity to identify patterns exceed human ability, but they remain tools. Critiquing them as if they were gods is a strawman.

If you want this discussion to matter, show a single reproducible example where an LLM fails in a way your logic cannot explain. Otherwise, repeating slogans and metaphors only illustrates a resistance to evidence.

I am not here to argue for ideology. I am here to examine claims. That is a choice. It is also a choice to resist slogans and demand specificity. Fun, fun. Another fun day.

technocrit@lemmy.dbzer0.com · 3 months ago

Wow a corporate press release? The peak of science!!! jfc.

Communist@lemmy.frozeninferno.xyz · edit-2 3 months ago

It doesn’t have to be to invalidate the claim. It proposed a novel hypothesis, this is the easiest thing to check in the world.

since I don’t have to rely on google it really doesn’t have to even be a decent source.

nymnympseudonym@piefed.social · 4 months ago

LLMs are just fast sorting and probability, they have no way to ever develop novel ideas or comprehension

And how do you think animal brains develop comprehension…?

just_another_person@lemmy.world · 4 months ago

Animal brains have pliable neuron networks and synapses to build and persist new relationships between things. LLMs do not. This is why they can’t have novel or spontaneous ideation. They don’t “learn” anything, no matter what Sam Altman is pitching you.

Now…if someone develops this ability, then they might be able to move more towards that…which is the point of this article and why the guy is leaving to start his own project doing this thing.

So you sort of sarcastically answered your own stupid question 🤌

nymnympseudonym@piefed.social · 4 months ago

Animal brains have pliable neuron networks and synapses to build and persist new relationships between things. LLMs do not. This is why they can’t have novel or spontaneous ideation

This Nobel prize winner seems to disagree with you.

Neural nets do indeed learn new relationships. Maybe you are thinking of the fact that most architectures require training to be a separate process from interacting; that is not the case for all architectures.

just_another_person@lemmy.world · 4 months ago

From your own linked paper:

To design a neural long-term memory module, we need a model that can encode the abstraction of the past history into its parameters. An example of this can be LLMs that are shown to be memorizing their training data [98, 96, 61]. Therefore, a simple idea is to train a neural network and expect it to memorize its training data. Memorization, however, has almost always been known as an undesirable phenomena in neural networks as it limits the model generalization [7], causes privacy concerns [98], and so results in poor performance at test time. Moreover, the memorization of the training data might not be helpful at test time, in which the data might be out-of-distribution. We argue that, we need an online meta-model that learns how to memorize/forget the data at test time. In this setup, the model is learning a function that is capable of memorization, but it is not overfitting to the training data, resulting in a better generalization at test time.

Literally what I just said. This is specifically addressing the problem I mentioned, and goes on further to exacting specificity on why it does not exist in production tools for the general public (it’ll never make money, and it’s slow, honestly). In fact, there is a minor argument later on that developing a separate supporting system negates even referring to the outcome as an LLM, and the supported referenced papers linked at the bottom dig even deeper into the exact thing I mentioned on the limitations of said models used in this way.

UnderpantsWeevil@lemmy.world · 4 months ago

Sounds reasonable.

Does it, though? Feels like we’re just rewriting the sales manual without thinking about what “learning from video” would actually entail.

Doesn’t make sense to build a lot of compute capacity, then spend fifteen years banging on research before you have something to utilize that capacity.

There’s an old book from back in 2008 - Killing Sacred Cows: Overcoming the Financial Myths That Are Destroying Your Prosperity - that a lot of the modern Techbros took perhaps too closely to heart. It posited that chasing the next generation of technological advancement was more important than keeping your existing revenue streams functional. And you really should kill the golden goose if it means you’ve got a shot at new one in the near future.

What these Tech Companies are chasing is the Next Big Thing, even when they don’t really understand what that is. And they’re so blindly devoted to advancing the technological curve that they really will blow a trillion dollars (mostly of other people’s money) on whatever it is they think that might be.

The real problem is that these guys are, largely, uncreative and incurious and not particularly intelligent. So they leap on fads rather than pursuing meaningful Blue Sky Research. And that gives us this endless recycling of Sci-Fi tropes as a stand in for material investments in productive next generation infrastructure.

Avid Amoeba@lemmy.ca · 4 months ago

I saw a short interview with him by France 24 and he mainy said he thinks the current direction of the research teams at Meta is wrong. He made a contrast between top-down push to deliver org as opposed to long leash, leave the researches to experiment with things. He said Meta shifted from the latter to the former and he doesn’t agree with the approach.

chrash0@lemmy.world · 4 months ago

he’s been salty about this for years now and frustrated at companies throwing training and compute scaling at LLMs hoping for another emergent breakthrough like GPT-3. i believe he’s the one that really tried to push the Llama models toward multimodality

MonkderVierte@lemmy.zip · edit-2 4 months ago

I don’t want him as a boss even if i were fully into AI.

violentfart@lemmy.world · 4 months ago

I mean come on, you can tell he’s been looking around for something else.

youmaynotknow@lemmy.zip · 4 months ago

Yai, another BS AI slop data grabbing AI company, because we can’t have enough of that shit.

technocrit@lemmy.dbzer0.com · 3 months ago

Grifters gonna grift each other.