@cyrusofeden
Cyrus
2 months
My hunch is that an LLM on its own will not be AGI. But that applying LLMs in new architectures will achieve high performance. We’re at the beginning of a Cambrian explosion of language programs. And DSPy is a tool to help you explore that design space
@lateinteraction
Omar Khattab
2 months
This is a new paradigm, but it takes work. There are huge steps going from Deep Learning to Transformers to GPT-4. Similarly, there's a lot of work to go from Language Programs (DSPy) to the eventual kinds of AI systems this will lead to. I'm working on that & more of us should
1
4
41
2
1
8

Replies

@themitak
Mitko
2 months
@cyrusofeden Interesting! What are some examples of DSPy outperforming other techniques? Or an underpowered LLM+DSPy outperforming a stronger LLM?
1
0
0
@cyrusofeden
Cyrus
2 months
@lateinteraction
Omar Khattab
8 months
A cool thread yesterday used GPT4 ($50), a 500-word ReAct prompt, and ~400 lines of code to finetune Llama2-7B to get 26% HotPotQA EM. Let's use 30 lines of DSPy—without any hand-written prompts or any calls to OpenAI ($0)—to teach a 9x smaller T5 (770M) model to get 39% EM! 🧵
Tweet media one
19
141
972
1
0
1
@jrysana
John
2 months
@cyrusofeden ASI will look like an LLM from the outside but it will not be just an LLM on the inside
0
0
2