Interconnects @interconnectsai Twitter profile

Pinned Tweet

Interconnects

10 months

If you’re a student and want to read paid posts, contact @natolambert by email or DM. Happy to provide a base 80%+ discount.

2

1

6

Last Seen Profiles

@Marlee92246247

@FemsubCami

@PlatinumKey13

@CAPTAlNPUGH

@ANABROWNHORAN1

@AndyMann2017

@ladyToni_Remer

@clintonjeff

@alicekeeler

@MirzaRe05924314

@TuruGlobal

@DanielJiang0603

@TonyDiprose8

@PupPastel

@FisonSienn75971

@SayurLodehBTW2

@brucegelin

@DexBeta

@ProwlerAwards

@Lulo_Benitez

@Ben10MMA

@MT_WBB

@shroomojis

@hsgwrn29

@ShadowMann9

@agillingham18

@ivvan_ln

@StevenW82850819

@MartinaMon40637

@Luisalb00042633

@Faydherberider

@ABIN9999999

@Sad0Bone

@BlandiOficial

@fantasyfber

@gavinnaquin14

Interconnects

@interconnectsai

6 months

Synthetic data: Anthropic’s CAI, from fine-tuning to pretraining, OpenAI’s Superalignment, tips, types, and open examples Synthetic data is the accelerator of the next phase of AI — what it is and what it means.

Synthetic data: Anthropic’s CAI, scaling, OpenAI’s Superalignment, tips, and open-source examples

Synthetic data is the accelerator of the next phase of AI — what it is and what it means.

www.interconnects.ai

2

24

124

Interconnects

@interconnectsai

4 months

Model merging lessons in The Waifu Research Department When what seems like pure LLM black magic is actually supported by the literature.

Model merging lessons in The Waifu Research Department

When what seems like pure LLM black magic is actually supported by the literature.

www.interconnects.ai

2

12

68

Interconnects

@interconnectsai

2 months

DBRX: The new best open model and Databricks’ ML strategy Databricks’ new model is surpassing the performance of Mixtral and Llama 2 70B while still being in a size category that's reasonably accessible.

DBRX: The new best open model and Databricks’ ML strategy

Databricks’ new model is surpassing the performance of Mixtral and Llama 2 70B while still being in a size category that's reasonably accessible.

www.interconnects.ai

2

12

59

Interconnects

@interconnectsai

5 months

State-space LLMs: Do we need Attention? Mamba, StripedHyena, Based, research overload, and the exciting future of many LLM architectures all at once.

State-space LLMs: Do we need Attention?

Mamba, StripedHyena, Based, research overload, and the exciting future of many LLM architectures all at once.

www.interconnects.ai

0

6

24

Interconnects

@interconnectsai

6 months

RLHF progress: Scaling DPO to 70B, DPO vs PPO update, Tülu 2, Zephyr-β, meaningful evaluation, data contamination Huge steps forward in confirming that RLHF can really help you on vibes based evaluation, among many other RLHF analyses.

RLHF progress: Scaling DPO to 70B, DPO vs PPO update, Tülu 2, Zephyr-β, meaningful evaluation, data...

Huge steps forward in confirming that RLHF can really help you on vibes based evaluation, among many other RLHF analyses.

www.interconnects.ai

1

2

34

Interconnects

@interconnectsai

5 months

Big Tech's LLM evals are just marketing A PSA everyone needs. The importance of a wait and see attitude when it comes to new models, big and small, open and closed.

Big Tech's LLM evals are just marketing

A PSA everyone needs. The importance of a wait and see attitude when it comes to new models, big and small, open and closed.

www.interconnects.ai

1

4

7

Interconnects

@interconnectsai

7 months

RLHF lit. review #1 and missing pieces in RLHF: Looking at the difference between two sets -- what rumors say industry leaders are doing with RLHF and what the literature is up to. A new series studying RLHF literature.

RLHF lit. review #1 and missing pieces in RLHF

Looking at the difference between two sets -- what rumors say industry leaders are doing with RLHF and what the literature is up to. I'm starting my new series studying RLHF literature.

www.interconnects.ai

1

8

29

Interconnects

@interconnectsai

5 months

Interviewing Tri Dao and Michael Poli of Together AI on the future of LLM architectures The first Interconnects research interview! We go even further on the promise of state-space models in the emerging LLM market.

Interviewing Tri Dao and Michael Poli on the future of LLM architectures

Listen now | The first Interconnects research interview! We go even further on the promise of state-space models in the emerging LLM market.

www.interconnects.ai

0

3

22

Interconnects

@interconnectsai

1 month

Llama 3: Scaling open LLMs to AGI Meta shows that scaling won't be a limit for open LLM players in the near future.

Llama 3: Scaling open LLMs to AGI

Llama 3 shows that scaling won't be a limit for open LLM progress in the near future.

www.interconnects.ai

2

4

21

Interconnects

@interconnectsai

7 months

Undoing RLHF and the brittleness of safe LLMs Recent papers show most of the arguments about needing "safety" in releases of open LLM weights are nearly dead in the water. Yes, still release the parameters. Read here:

Undoing RLHF and the brittleness of safe LLMs

Most of the arguments about "safe" releases of open LLM weights are nearly dead in the water.

www.interconnects.ai

0

7

20

Interconnects

@interconnectsai

12 days

ChatBotArena: The peoples’ LLM evaluation, the future of evaluation, the incentives of evaluation, and gpt2chatbot What the details tell us about the most in-vogue LLM evaluation tool — and the rest of the field.

ChatBotArena: The peoples’ LLM evaluation, the future of evaluation, the incentives of evaluation,...

What the details tell us about the most in-vogue LLM evaluation tool — and the rest of the field.

www.interconnects.ai

0

2

18

Interconnects

@interconnectsai

1 month

The end of the “best open LLM” Modeling the compute versus performance tradeoff of many open LLMs.

The end of the “best open LLM”

Modeling the compute versus performance tradeoff of many open LLMs.

www.interconnects.ai

1

2

18

Interconnects

@interconnectsai

10 days

OpenAI’s Model (behavior) Spec, RLHF transparency, personalization questions Now we will have some grounding for when weird ChatGPT behaviors are intended or side-effects — shrinking the Overton window of RLHF bugs.

OpenAI’s Model (behavior) Spec, RLHF transparency, and personalization

Now we will have some grounding for when weird ChatGPT behaviors are intended or side-effects — shrinking the Overton window of RLHF bugs.

www.interconnects.ai

5

11

16

Interconnects

@interconnectsai

4 months

Multimodal LM roundup: Unified IO 2, inputs and outputs, Gemini, LLaVA-RLHF, and RLHF questions A sampling of recent happenings in the multimodal space. Be sure to expect more this year.

Multimodal LM roundup: Unified IO 2, inputs and outputs, Gemini, LLaVA-RLHF, and RLHF questions

A sampling of recent happenings in the multimodal space. Be sure to expect more this year.

www.interconnects.ai

0

3

16

Interconnects

@interconnectsai

3 months

How to cultivate a high-signal AI feed Basic tips on how to assess inbound ML content and cultivate your news feed.

How to cultivate a high-signal AI feed

Basic tips on how to assess inbound ML content and cultivate your news feed.

www.interconnects.ai

2

1

16

Interconnects

@interconnectsai

21 days

Phi 3 and Arctic: Outlier LMs are hints Models that seem totally out of scope from recent open LLMs give us a sneak peek of where the industry will be in 6 to 18 months.

Phi 3 and Arctic: Outlier LMs are hints

Models that seem totally out of scope from recent open LLMs give us a sneak peek of where the industry will be in 6 to 18 months.

www.interconnects.ai

0

3

16

Interconnects

@interconnectsai

7 months

How the Foundation Model Transparency Index Distorts Transparency, by @natolambert SE Gyges @BlancheMinerva @aviskowron (Cross post with @AiEleuther )

How the Foundation Model Transparency Index Distorts Transparency

A proper critique of the Foundation Model Transparency Index (FMTI). Plus some thoughts on the ecosystem implications.

www.interconnects.ai

0

3

16

Interconnects

@interconnectsai

2 months

Model commoditization and product moats Where moats are tested now that so many people have trained GPT4 class models. Claude 3, Gemini 1.5, Inflection 2.5, and Mistral Large are here to party.

Model commoditization and product moats

Where moats are tested now that so many people have trained GPT4 class models. Claude 3, Gemini 1.5, Inflection 2.5, and Mistral Large are here to party.

www.interconnects.ai

0

2

16

Interconnects

@interconnectsai

6 months

The DPO debate: Do we need RL for RLHF? Direct vs. RL methods for preferences, more RLHF models, and hard truths in open RLHF work. We have more questions than answers.

Do we need RL for RLHF?

Direct (DPO) vs. RL methods for preferences, more RLHF models, and hard truths in open RLHF work. We have more questions than answers.

www.interconnects.ai

2

3

12

Interconnects

@interconnectsai

2 months

Evaluations: Trust, performance, and price (bonus, announcing RewardBench) Evaluation is not only getting harder with modern LLMs getting more complicated, it’s getting harder because it means something different.

Evaluations: Trust, performance, and price (bonus, announcing RewardBench)

Evaluation is not only getting harder with modern LLMs, it’s getting harder because it means something different.

www.interconnects.ai

0

3

15

Interconnects

@interconnectsai

5 months

Mixtral Round-up: MoE trade-offs, release lessons, Mistral raises $400mil, Google's loss, vibes vs marketing Emergency blog 🚨 We have an amazing open mixture of experts model for the holidays!

Mixtral: The best open model, MoE trade-offs, release lessons, Mistral raises $400mil, Google's...

We have an amazing open mixture of experts model for the holidays!

www.interconnects.ai

1

3

13

Interconnects

@interconnectsai

1 month

We don’t need to reinvent everything to solve alignment Integrating some non-computing science into reinforcement learning from human feedback (RLHF) can give us the models we want. Bonus: OLMo 1.7-7B.

Stop "reinventing" everything to solve alignment

Integrating some non-computing science into reinforcement learning from human feedback (RLHF) can give us the models we want.

www.interconnects.ai

1

2

12

Interconnects

@interconnectsai

5 months

It's 2024 and they just want to learn The state of the ML communities big and small starting 2024. My general expectations for the year.

It's 2024 and they just want to learn

The state of the ML communities big and small starting 2024. My general expectations for the year.

www.interconnects.ai

0

3

12

Interconnects

@interconnectsai

7 months

The AI research job market shit show (and my experience) There are plenty of jobs, but finding a place where you're happy is as hard as ever. Read here:

The AI research job market shit show (and my experience)

There are plenty of jobs, but finding a place where you're happy is as hard as ever.

www.interconnects.ai

0

2

11

Interconnects

@interconnectsai

10 months

Specifying objectives in RLHF: the links between the scientific weirdness of RLHF, DPO, and @johnschulman2 's ICML talk on proxy objectives.

Specifying objectives in RLHF

At ICML, it is obvious that many people are getting value out of RLHF. What is limiting the scientific understanding of it (other than research embargoes)?

www.interconnects.ai

0

2

11

Interconnects

@interconnectsai

4 months

Open Language Models (OLMos) and the LLM landscape A small model at the beginning of big changes.

Open Language Models (OLMos) and the LLM landscape

A small model at the beginning of big changes.

www.interconnects.ai

0

9

Interconnects

@interconnectsai

3 months

OpenAI’s Sora for video, Gemini 1.5's infinite context, and a secret Mistral model 🚨 Emergency blog! Three things you need to know from the ML world that arrived on Thursday.

OpenAI’s Sora for video, Gemini 1.5's infinite context, and a secret Mistral model

Emergency blog! Three things you need to know from the ML world that arrived yesterday.

www.interconnects.ai

0

3

7

Interconnects

@interconnectsai

7 months

Open LLM company playbook Where does releasing model weights fit into company strategy? 3 requirements, 3 actions, and 3 benefits of being in the open LLM space.

Open LLM company playbook

Where does releasing model weights fit into company strategy? 3 requirements, 3 actions, and 3 benefits of being in the open LLM space.

www.interconnects.ai

0

5

9

Interconnects

@interconnectsai

27 days

AGI is what you want it to be Certain definitions of AGI are backing people into a pseudo-religious corner.

AGI is what you want it to be

Certain definitions of AGI are backing people into a pseudo-religious corner.

www.interconnects.ai

0

2

8

Interconnects

@interconnectsai

6 months

OpenAI’s shakeup and the left turn in the narrative New timelines that emerge in AI and the winners and losers, regardless of the unfolding details.

0

1

7

Interconnects

@interconnectsai

6 months

The Q* hypothesis Tree-of-thoughts reasoning, process reward models, and supercharging synthetic data 🚨 Emergency special: The information we need to understand what Q* is was right in front of us, but the memes are more fun than reality.

The Q* hypothesis: Tree-of-thoughts reasoning, process reward models, and supercharging synthetic...

Emergency special: The information we need to understand what Q* is was right in front of us, but the memes are more fun than reality.

www.interconnects.ai

0

1

7

Interconnects

@interconnectsai

10 months

LLM agents follow-up: exploration, RLHF, and more: How does autonomy of language models relate to data collection. [partially $]

LLM agents follow-up: exploration, RLHF, and more

How does autonomy of language models relate to data collection.

www.interconnects.ai

0

2

6

Interconnects

@interconnectsai

6 months

The interface era of AI Modern LLMs are becoming the easiest and most efficient way to access information. This will change how we see the world.

The interface era of AI

Modern LLMs are becoming the easiest and most efficient way to access information. This will change how we see the world.

www.interconnects.ai

0

6

Interconnects

@interconnectsai

3 months

The koan of an open-source LLM A proposal for a new definition of an “open-source” LLM and why no definition will ever just work.

The koan of an open-source LLM

A proposal for a new definition of an “open-source” LLM and why no definition will ever just work.

www.interconnects.ai

0

2

6

Interconnects

@interconnectsai

3 months

Why reward models are key for alignment In an era dominated by direct preference optimization and LLM-as-a-judge, why do we still need a model to output only a scalar reward?

Why reward models are key for alignment

In an era dominated by direct preference optimization and LLM-as-a-judge, why do we still need a model to output only a scalar reward?

www.interconnects.ai

1

0

6

Interconnects

@interconnectsai

8 months

Midjourney vs. Ideogram, ML product companies, preventing AI winter, by @natolambert The coming image-generation battle and its implications on ML product longevity.

Midjourney vs. Ideogram, ML product companies, preventing AI winter, DALL·E 3 tease

The coming image-generation battle and its implications on ML product longevity.

www.interconnects.ai

0

1

5

Interconnects

@interconnectsai

8 months

DALL·E 3 and multimodality as moats, correcting bad moat takes, by @natolambert

DALL·E 3 and multimodality as moats, correcting bad moat takes

Multimodality may be a key differentiator in how moats are built for LLMs.

www.interconnects.ai

0

1

5

Interconnects

@interconnectsai

10 months

Llama 2 follow-up: too much RLHF, GPU sizing, technical details, and more

Llama 2 follow-up: too much RLHF, GPU sizing, technical details

The community reaction to Llama 2 and all of the things that I didn't get to in the first issue.

www.interconnects.ai

0

2

5

Interconnects

@interconnectsai

3 months

10 Sora and Gemini 1.5 follow-ups: code-base in context, deepfakes, pixel-peeping, inference costs, and more The cutting edge technical discussions beneath the wow factor.

10 Sora and Gemini 1.5 follow-ups: code-base in context, deepfakes, pixel-peeping, inference costs…

The cutting edge technical discussions beneath the wow factor.

www.interconnects.ai

1

2

5

Interconnects

@interconnectsai

3 months

Google ships it: Gemma open LLMs and Gemini backlash Google rejoins the open model party and gets some backlash for a frequent problem for generative AI.

Google ships it: Gemma open LLMs and Gemini backlash

Google rejoins the open model party and gets some backlash for a frequent problem for generative AI.

www.interconnects.ai

0

2

5

Interconnects

@interconnectsai

3 months

Alignment-as-a-service: Scale AI vs. the new guys Scale’s making over $750 million per year selling data for RLHF, who’s coming to take it?

Alignment-as-a-Service: Scale AI vs. the new guys

Scale’s making over $750 million per year selling data for RLHF, who’s coming to take it?

www.interconnects.ai

0

1

5

Interconnects

@interconnectsai

5 months

Where 2024’s “open GPT4” can’t match OpenAI’s And why the comparisons don't really matter. Repeated patterns in the race for reproducing ChatGPT, another year of evaluation crises, and people who will take awesome news too far.

Where 2024’s “open GPT4” can’t match OpenAI’s

And why the comparisons don't really matter. Repeated patterns in the race for reproducing ChatGPT, another year of evaluation crises, and people who will take awesome news too far.

www.interconnects.ai

0

3

5

Interconnects

@interconnectsai

8 months

Can robotics take off like GenAI? Moravec's paradox vs. scaling laws, by @natolambert . Arguments in the literature for and against rapid progress in robotic learning research.

Can robotics take off like GenAI? Moravec's paradox vs. scaling laws

Arguments for and against rapid progress in robotic learning research.

www.interconnects.ai

0

4

Interconnects

@interconnectsai

8 months

Open, general-purpose LLM companies might not be viable Failure modes on the quest to open-source LLMs (coming from someone who wants openness). Expect pivots to specialized models.

Open-source, general LLM companies might not be viable

Failure modes on the quest to general, open-source LLMs. Expect pivots to specialized models.

www.interconnects.ai

0

1

4

Interconnects

@interconnectsai

19 days

How RLHF works, part 2: A thin line between useful and lobotomized Many, many signs of life for preference fine-tuning beyond spoofing chat evaluation tools.

How RLHF works, part 2: A thin line between useful and lobotomized

Many, many signs of life for preference fine-tuning beyond spoofing chat evaluation tools.

www.interconnects.ai

0

1

4

Interconnects

@interconnectsai

4 months

Local LLMs, some facts some fiction The deployment path that’ll break through in 2024. Plus, checking in on strategies across Big Tech and AI leaders.

Local LLMs, some facts some fiction

The deployment path that’ll break through in 2024. Plus, checking in on strategies across Big Tech and AI leaders.

www.interconnects.ai

0

1

4

Interconnects

@interconnectsai

3 months

Interviewing Louis Castricato of Synth Labs and Eleuther AI on RLHF, Gemini Drama, DPO, founding Carper AI, preference data, reward models, and everything in between An interview I've wanted to bring you for a while.

Interviewing Louis Castricato of Synth Labs and Eleuther AI on RLHF, Gemini Drama, DPO, founding...

An interview I've wanted to bring you for a while.

www.interconnects.ai

0

4

Interconnects

@interconnectsai

8 months

LLMs are computing platforms This fact is why so many debates around LLMs feel broken, especially moderation.

LLMs are computing platforms

This fact is why so many debates around LLMs feel broken, especially moderation.

www.interconnects.ai

0

3

Interconnects

@interconnectsai

10 months

LLM agents and integration dead-ends: When is GPT4 going to schedule my meetings? What is stopping it?

LLM agents and integration dead-ends

When is GPT4 going to schedule my meetings? What is stopping it?

www.interconnects.ai

0

2

3

Interconnects

@interconnectsai

5 months

Interconnects year in review: 2023 The core themes of ML and the blog this year. What changes in 2024.

Interconnects year in review: 2023

The core themes of ML and the blog this year. What changes in 2024.

www.interconnects.ai

0

1

3

Interconnects

@interconnectsai

2 months

We disagree on what open-source AI should mean ... and that's okay. How to read what multiple people mean by the word openness and see through the PR speak.

Why we disagree on what open-source AI should be

How to read what multiple people mean by the word openness and see through the PR speak.

www.interconnects.ai

1

3

Interconnects

@interconnectsai

4 months

Multimodal blogging: My AI tools to expand your audience A fun demo on how generative AI can transform content creation, and tools for my fellow writers on Substack!

Multimodal blogging: My AI tools to expand your audience

A fun demo on how generative AI can transform content creation, and tools for my fellow writers on Substack!

www.interconnects.ai

0

2

3

Interconnects

@interconnectsai

9 months

AI researchers' challenges: atomic analogies and strained institutions Checking in on the Oppenheimer comparisons to AI and how AI research has changed in the last few years (focusing on distribution and participation).

AI researchers' challenges: atomic analogies and strained institutions

We need to heal AI research norms, not build a super project. The reflections and reverberations that we're feeling in the AI community from Oppenheimer's quest for the atomic bomb.

www.interconnects.ai

0

3

Interconnects

@interconnectsai

10 months

LLAMA 2: an an incredible open-source LLM An analysis of the model and what it means.

Llama 2: an incredible open LLM

Meta is continuing to deliver high-quality research artifacts and not backing down from pressure against open source.

www.interconnects.ai

0

2

Interconnects

@interconnectsai

6 months

Reckoning with the Shoggoth of AI Culture wars, open letters, new politics, developer days, and everything hidden under the smiling face of RLHF.

Reckoning with the Shoggoth of AI

Culture wars, open letters, new politics, developer days, and everything hidden under the smiling face of RLHF.

www.interconnects.ai

0

1

2

Interconnects

@interconnectsai

4 months

RLHF learning resources in 2024 A list for beginners and wannabe experts and everyone in between.

RLHF learning resources in 2024

A list for beginners and wannabe experts and everyone in between.

www.interconnects.ai

0

2

Interconnects

@interconnectsai

10 months

LLM products: measurement and manipulation Two stories will begin to unfold as the AI capabilities-to-product overhang is reduced.

LLM products: measurement and manipulation

Two stories will begin to unfold as the AI capabilities-to-product overhang is reduced.

www.interconnects.ai

0

1

2

Interconnects

@interconnectsai

10 months

"If it's not fully closed ML, it's open" - is it? A vibe check on the open versus closed LLM debate.

"If it's not fully closed ML, it's open" - is it?

Definitions from open-source software are being bent by new machine learning technologies.

www.interconnects.ai

0

1

2

Interconnects

@interconnectsai

11 months

Today (partially $): How LLM based disinformation can go beyond just generations and into distribution. * RL training for nefarious targets, * Moderation & generative text

Disinformation with LLMs: the distribution risk

Why changing the dynamics around the distribution of malicious content is a growing concern in addition to just the generations themselves.

www.interconnects.ai

1

2

Interconnects

@interconnectsai

9 months

Cruise's collisions and adapting to AI SF continues to be the center of attention for developments in AI, but this time it's in the physical world.

Cruise's collisions and adapting to AI

SF continues to be the center of attention for developments in AI, but this time it's in the physical world.

www.interconnects.ai

0

1

Interconnects

@interconnectsai

5 months

Audio!

Interconnects Audio | Mixtral: The best open model, MoE trade-offs, release lessons, Mistral raises...

(some buggy audio in this one, from MoE rather than Mixtral lol)Mixtral: The best open model, MoE trade-offs, release lessons, Mistral raises $400mil, Google's loss, vibes vs marketingWe have an am...

podcast.interconnects.ai

0

1

Interconnects

@interconnectsai

8 months

How the open-source LLM ecosystem & leaderboards work: No, SemiAnalysis, HuggingFace isn't misleading all of open-source, and open-source is still making real progress.

How the open-source LLM ecosystem & leaderboards work

No, SemiAnalysis, HuggingFace isn't misleading all of open-source, and open-source is still making real progress.

www.interconnects.ai

0

1