Interconnects Profile Banner
Interconnects Profile
Interconnects

@interconnectsai

2,131
Followers
1
Following
2
Media
68
Statuses

What you need to know about AI research trends, from @natolambert Wednesday mornings weekly, sometimes extra posts.

Joined June 2023
Don't wanna be here? Send us removal request.
Pinned Tweet
@interconnectsai
Interconnects
10 months
If you’re a student and want to read paid posts, contact @natolambert by email or DM. Happy to provide a base 80%+ discount.
2
1
6
@interconnectsai
Interconnects
6 months
Synthetic data: Anthropic’s CAI, from fine-tuning to pretraining, OpenAI’s Superalignment, tips, types, and open examples Synthetic data is the accelerator of the next phase of AI — what it is and what it means.
2
24
124
@interconnectsai
Interconnects
4 months
Model merging lessons in The Waifu Research Department When what seems like pure LLM black magic is actually supported by the literature.
2
12
68
@interconnectsai
Interconnects
2 months
DBRX: The new best open model and Databricks’ ML strategy Databricks’ new model is surpassing the performance of Mixtral and Llama 2 70B while still being in a size category that's reasonably accessible.
2
12
59
@interconnectsai
Interconnects
5 months
State-space LLMs: Do we need Attention? Mamba, StripedHyena, Based, research overload, and the exciting future of many LLM architectures all at once.
0
6
24
@interconnectsai
Interconnects
6 months
RLHF progress: Scaling DPO to 70B, DPO vs PPO update, Tülu 2, Zephyr-β, meaningful evaluation, data contamination Huge steps forward in confirming that RLHF can really help you on vibes based evaluation, among many other RLHF analyses.
1
2
34
@interconnectsai
Interconnects
5 months
Big Tech's LLM evals are just marketing A PSA everyone needs. The importance of a wait and see attitude when it comes to new models, big and small, open and closed.
1
4
7
@interconnectsai
Interconnects
7 months
RLHF lit. review #1 and missing pieces in RLHF: Looking at the difference between two sets -- what rumors say industry leaders are doing with RLHF and what the literature is up to. A new series studying RLHF literature.
1
8
29
@interconnectsai
Interconnects
5 months
Interviewing Tri Dao and Michael Poli of Together AI on the future of LLM architectures The first Interconnects research interview! We go even further on the promise of state-space models in the emerging LLM market.
0
3
22
@interconnectsai
Interconnects
1 month
Llama 3: Scaling open LLMs to AGI Meta shows that scaling won't be a limit for open LLM players in the near future.
2
4
21
@interconnectsai
Interconnects
7 months
Undoing RLHF and the brittleness of safe LLMs Recent papers show most of the arguments about needing "safety" in releases of open LLM weights are nearly dead in the water. Yes, still release the parameters. Read here:
0
7
20
@interconnectsai
Interconnects
12 days
ChatBotArena: The peoples’ LLM evaluation, the future of evaluation, the incentives of evaluation, and gpt2chatbot What the details tell us about the most in-vogue LLM evaluation tool — and the rest of the field.
0
2
18
@interconnectsai
Interconnects
1 month
The end of the “best open LLM” Modeling the compute versus performance tradeoff of many open LLMs.
1
2
18
@interconnectsai
Interconnects
10 days
OpenAI’s Model (behavior) Spec, RLHF transparency, personalization questions Now we will have some grounding for when weird ChatGPT behaviors are intended or side-effects — shrinking the Overton window of RLHF bugs.
5
11
16
@interconnectsai
Interconnects
4 months
Multimodal LM roundup: Unified IO 2, inputs and outputs, Gemini, LLaVA-RLHF, and RLHF questions A sampling of recent happenings in the multimodal space. Be sure to expect more this year.
0
3
16
@interconnectsai
Interconnects
3 months
How to cultivate a high-signal AI feed Basic tips on how to assess inbound ML content and cultivate your news feed.
2
1
16
@interconnectsai
Interconnects
21 days
Phi 3 and Arctic: Outlier LMs are hints Models that seem totally out of scope from recent open LLMs give us a sneak peek of where the industry will be in 6 to 18 months.
0
3
16
@interconnectsai
Interconnects
2 months
Model commoditization and product moats Where moats are tested now that so many people have trained GPT4 class models. Claude 3, Gemini 1.5, Inflection 2.5, and Mistral Large are here to party.
0
2
16
@interconnectsai
Interconnects
6 months
The DPO debate: Do we need RL for RLHF? Direct vs. RL methods for preferences, more RLHF models, and hard truths in open RLHF work. We have more questions than answers.
2
3
12
@interconnectsai
Interconnects
2 months
Evaluations: Trust, performance, and price (bonus, announcing RewardBench) Evaluation is not only getting harder with modern LLMs getting more complicated, it’s getting harder because it means something different.
0
3
15
@interconnectsai
Interconnects
5 months
Mixtral Round-up: MoE trade-offs, release lessons, Mistral raises $400mil, Google's loss, vibes vs marketing Emergency blog 🚨 We have an amazing open mixture of experts model for the holidays!
1
3
13
@interconnectsai
Interconnects
1 month
We don’t need to reinvent everything to solve alignment Integrating some non-computing science into reinforcement learning from human feedback (RLHF) can give us the models we want. Bonus: OLMo 1.7-7B.
1
2
12
@interconnectsai
Interconnects
5 months
It's 2024 and they just want to learn The state of the ML communities big and small starting 2024. My general expectations for the year.
0
3
12
@interconnectsai
Interconnects
7 months
The AI research job market shit show (and my experience) There are plenty of jobs, but finding a place where you're happy is as hard as ever. Read here:
0
2
11
@interconnectsai
Interconnects
4 months
Open Language Models (OLMos) and the LLM landscape A small model at the beginning of big changes.
0
0
9
@interconnectsai
Interconnects
3 months
OpenAI’s Sora for video, Gemini 1.5's infinite context, and a secret Mistral model 🚨 Emergency blog! Three things you need to know from the ML world that arrived on Thursday.
0
3
7
@interconnectsai
Interconnects
7 months
Open LLM company playbook Where does releasing model weights fit into company strategy? 3 requirements, 3 actions, and 3 benefits of being in the open LLM space.
0
5
9
@interconnectsai
Interconnects
27 days
AGI is what you want it to be Certain definitions of AGI are backing people into a pseudo-religious corner.
0
2
8
@interconnectsai
Interconnects
6 months
OpenAI’s shakeup and the left turn in the narrative New timelines that emerge in AI and the winners and losers, regardless of the unfolding details.
Tweet media one
0
1
7
@interconnectsai
Interconnects
6 months
The Q* hypothesis Tree-of-thoughts reasoning, process reward models, and supercharging synthetic data 🚨 Emergency special: The information we need to understand what Q* is was right in front of us, but the memes are more fun than reality.
0
1
7
@interconnectsai
Interconnects
10 months
LLM agents follow-up: exploration, RLHF, and more: How does autonomy of language models relate to data collection. [partially $]
0
2
6
@interconnectsai
Interconnects
6 months
The interface era of AI Modern LLMs are becoming the easiest and most efficient way to access information. This will change how we see the world.
0
0
6
@interconnectsai
Interconnects
3 months
The koan of an open-source LLM A proposal for a new definition of an “open-source” LLM and why no definition will ever just work.
0
2
6
@interconnectsai
Interconnects
3 months
Why reward models are key for alignment In an era dominated by direct preference optimization and LLM-as-a-judge, why do we still need a model to output only a scalar reward?
1
0
6
@interconnectsai
Interconnects
8 months
Midjourney vs. Ideogram, ML product companies, preventing AI winter, by @natolambert The coming image-generation battle and its implications on ML product longevity.
0
1
5
@interconnectsai
Interconnects
3 months
10 Sora and Gemini 1.5 follow-ups: code-base in context, deepfakes, pixel-peeping, inference costs, and more The cutting edge technical discussions beneath the wow factor.
1
2
5
@interconnectsai
Interconnects
3 months
Google ships it: Gemma open LLMs and Gemini backlash Google rejoins the open model party and gets some backlash for a frequent problem for generative AI.
0
2
5
@interconnectsai
Interconnects
3 months
Alignment-as-a-service: Scale AI vs. the new guys Scale’s making over $750 million per year selling data for RLHF, who’s coming to take it?
0
1
5
@interconnectsai
Interconnects
5 months
Where 2024’s “open GPT4” can’t match OpenAI’s And why the comparisons don't really matter. Repeated patterns in the race for reproducing ChatGPT, another year of evaluation crises, and people who will take awesome news too far.
0
3
5
@interconnectsai
Interconnects
8 months
Can robotics take off like GenAI? Moravec's paradox vs. scaling laws, by @natolambert . Arguments in the literature for and against rapid progress in robotic learning research.
0
0
4
@interconnectsai
Interconnects
8 months
Open, general-purpose LLM companies might not be viable Failure modes on the quest to open-source LLMs (coming from someone who wants openness). Expect pivots to specialized models.
0
1
4
@interconnectsai
Interconnects
19 days
How RLHF works, part 2: A thin line between useful and lobotomized Many, many signs of life for preference fine-tuning beyond spoofing chat evaluation tools.
0
1
4
@interconnectsai
Interconnects
4 months
Local LLMs, some facts some fiction The deployment path that’ll break through in 2024. Plus, checking in on strategies across Big Tech and AI leaders.
0
1
4
@interconnectsai
Interconnects
3 months
Interviewing Louis Castricato of Synth Labs and Eleuther AI on RLHF, Gemini Drama, DPO, founding Carper AI, preference data, reward models, and everything in between An interview I've wanted to bring you for a while.
0
0
4
@interconnectsai
Interconnects
8 months
LLMs are computing platforms This fact is why so many debates around LLMs feel broken, especially moderation.
0
0
3
@interconnectsai
Interconnects
10 months
LLM agents and integration dead-ends: When is GPT4 going to schedule my meetings? What is stopping it?
0
2
3
@interconnectsai
Interconnects
5 months
Interconnects year in review: 2023 The core themes of ML and the blog this year. What changes in 2024.
0
1
3
@interconnectsai
Interconnects
2 months
We disagree on what open-source AI should mean ... and that's okay. How to read what multiple people mean by the word openness and see through the PR speak.
1
1
3
@interconnectsai
Interconnects
4 months
Multimodal blogging: My AI tools to expand your audience A fun demo on how generative AI can transform content creation, and tools for my fellow writers on Substack!
0
2
3
@interconnectsai
Interconnects
9 months
AI researchers' challenges: atomic analogies and strained institutions Checking in on the Oppenheimer comparisons to AI and how AI research has changed in the last few years (focusing on distribution and participation).
0
0
3
@interconnectsai
Interconnects
6 months
Reckoning with the Shoggoth of AI Culture wars, open letters, new politics, developer days, and everything hidden under the smiling face of RLHF.
0
1
2
@interconnectsai
Interconnects
4 months
RLHF learning resources in 2024 A list for beginners and wannabe experts and everyone in between.
0
0
2
@interconnectsai
Interconnects
10 months
LLM products: measurement and manipulation Two stories will begin to unfold as the AI capabilities-to-product overhang is reduced.
0
1
2
@interconnectsai
Interconnects
10 months
"If it's not fully closed ML, it's open" - is it? A vibe check on the open versus closed LLM debate.
0
1
2
@interconnectsai
Interconnects
11 months
Today (partially $): How LLM based disinformation can go beyond just generations and into distribution. * RL training for nefarious targets, * Moderation & generative text
1
2
2
@interconnectsai
Interconnects
9 months
Cruise's collisions and adapting to AI SF continues to be the center of attention for developments in AI, but this time it's in the physical world.
0
1
1
@interconnectsai
Interconnects
8 months
How the open-source LLM ecosystem & leaderboards work: No, SemiAnalysis, HuggingFace isn't misleading all of open-source, and open-source is still making real progress.
0
0
1