Pan Lu @ ICLR 2024 Profile Banner
Pan Lu @ ICLR 2024 Profile
Pan Lu @ ICLR 2024

@lupantech

4,181
Followers
1,061
Following
177
Media
719
Statuses

PhD @CS_UCLA @uclanlp | Amazon/Bloomberg/Qualcomm/UCLA Fellows | Ex @Tsinghua_Uni @Microsoft @allen_ai | #NLPoc , AI4Math, AI4Science, LLMs, Reasoning

Bay Area
Joined April 2016
Don't wanna be here? Send us removal request.
Pinned Tweet
@lupantech
Pan Lu @ ICLR 2024
3 days
Today, we presented our #MathVista () at #ICLR2024 in Vienna! 🌟 We are thrilled by the tremendous progress in math reasoning in the era of LLMs and VLMs. MathVista has become one of the most reliable benchmarks for probing their abilities in visual math…
Tweet media one
@lupantech
Pan Lu @ ICLR 2024
7 months
🚀Excited to release our 112-page study on math reasoning in visual contexts via #MathVista . For the first time, we provide both quantitative and qualitative evaluations of #GPT4V , #Bard , & 10 other models. 📄✨Full paper: 🔗Proj: …
Tweet media one
16
79
313
5
9
67
@lupantech
Pan Lu @ ICLR 2024
1 year
🔥Excited to release LLaMA-Adapter! With only 1.2M learnable parameters and 52K instruction data, LLaMA-Adapter turns a #LLaMA into an instruction-following model within ONE hour, delivering high-quality responses! 🚀Paper: 🚀Code:
Tweet media one
24
174
820
@lupantech
Pan Lu @ ICLR 2024
11 months
🔥Thrilled to release LLaMa-Adapter Multimodal! 🎯Now supporting text, image, audio, and video inputs powered by #ImageBind . 🧵6 💻Codes for inference, pretraining, and finetuning ➕ checkpoints: demo: abs:
Tweet media one
15
150
646
@lupantech
Pan Lu @ ICLR 2024
11 months
🎉Exciting news: LLaMA-Adapter is now fully unlocked! 🧵6 1⃣ As a general-purpose #multimodal foundation model, it integrates various inputs like images, audio, text, video, and 3D point clouds, while providing image, text-based, and detection outputs. It uniquely accepts the…
Tweet media one
22
166
604
@lupantech
Pan Lu @ ICLR 2024
9 months
🚀Introducing #LLaMA2 -Accessory - an advanced open-source toolkit for large language models. Evolved from LLaMA-Adapter, we now support more datasets, tasks, visual encoders, and efficient optimization methods.🧠 🔗Code: 💡Key Features: 🎯 Pre-training…
Tweet media one
13
133
505
@lupantech
Pan Lu @ ICLR 2024
1 year
🚀65B LLaMA-Adapter-V2 code & checkpoint are NOW ready at ! 🛠️Big update enhancing multimodality & chatbot. 🔥LLaMA-Adapter-V2 surpasses #ChatGPT in response quality (102%:100%) & beats #Vicuna in win-tie-lost (50:14). ☕️Thanks to Peng Gao & @opengvlab ! 2/2
11
104
415
@lupantech
Pan Lu @ ICLR 2024
1 year
🎉New paper! The survey of deep learning for mathematical reasoning ( #DL4MATH ) is now available. We've seen tremendous growth in this community since 2018, and this review covers the tasks, datasets, and methods from the past decade. Check it out now:
Tweet media one
6
79
339
@lupantech
Pan Lu @ ICLR 2024
1 year
LLaMA-Adapter V2, the next-gen multi-modal instruction model, boasts a model size multiple times larger than 7B! 🌟🔥 Chatbot systems, get ready for a major upgrade! 🤖💬 Stay tuned! Technical report & models coming soon. 📄🔜Keep up to date! 🔗
Tweet media one
4
63
314
@lupantech
Pan Lu @ ICLR 2024
7 months
🚀Excited to release our 112-page study on math reasoning in visual contexts via #MathVista . For the first time, we provide both quantitative and qualitative evaluations of #GPT4V , #Bard , & 10 other models. 📄✨Full paper: 🔗Proj: …
Tweet media one
16
79
313
@lupantech
Pan Lu @ ICLR 2024
7 months
🚀 Introducing #SPHINX : The Next-Gen #Multimodal_LLM . Seamlessly blending Tasks, Embeddings & Weights for advanced multimodal reasoning. 🧵N 🔍Demo: 💻Code: What's New with #SPHINX compared to #LLaMA_Adapter ? 🆕 ✅ Powered by the…
Tweet media one
12
66
274
@lupantech
Pan Lu @ ICLR 2024
1 year
🚀Meet Chameleon! An innovative plug-and-play framework enhancing #GPT4 and #ChatGPT like #AutoGPT for compositional reasoning, blending off-the-shelf tools with tailored LLM models 🔧✨🧠. New SOTA on #ScienceQA and TabMWP! 📈 🔗 📜
Tweet media one
14
73
259
@lupantech
Pan Lu @ ICLR 2024
1 year
🚀 Introducing the LLaMA-Adapter, now available on @huggingface ! 🔗 🎉 Feel free to explore and experiment with our LLaMA-Adapter. We're eager to hear your feedback! 💥 Stay tuned for the upcoming second version - even more powerful and feature-packed!
3
41
245
@lupantech
Pan Lu @ ICLR 2024
4 months
🎉 Thrilled to have our MathVista work accepted at #ICLR2024 as an Oral presentation! Explore our work: 🔍 Project: 🤗 @huggingface Dataset @_akhaliq : 💻 Code: Deepest gratitude to our shining team: 👏🌟…
Tweet media one
@lupantech
Pan Lu @ ICLR 2024
7 months
🚀Excited to release our 112-page study on math reasoning in visual contexts via #MathVista . For the first time, we provide both quantitative and qualitative evaluations of #GPT4V , #Bard , & 10 other models. 📄✨Full paper: 🔗Proj: …
Tweet media one
16
79
313
7
33
248
@lupantech
Pan Lu @ ICLR 2024
1 month
I am thrilled to defend my PhD and finally earn the title of Doctor🧑‍🎓. It's been a truly rewarding journey at @UCLAComSci . I'm so fortunate and grateful for the invaluable mentorship from Prof. @kaiwei_chang @uclanlp . He has always been incredibly encouraging, helpful, and…
@kaiwei_chang
Kai-Wei Chang
1 month
Congrats 🎉 to the newly titled Dr. Lu @lupantech on defending his thesis about mathematical reasoning with language models"! 🧮 Pan has published a series of works on quantifying and improving math and scientific reasoning ability in LLMs. Some highlights:
1
5
81
42
2
232
@lupantech
Pan Lu @ ICLR 2024
1 year
🔥Boost your GPT-3 with our ICLR-23 paper on PromptPG! The first of its kind, PromptPG uses RL to select optimal examples for GPT-3, leading to a 5.31% gain on the TabMWP dataset of math word problems. Don't miss out on this game-changing solution! 👉 🧵1/7
2
30
227
@lupantech
Pan Lu @ ICLR 2024
2 months
🔍 Does Multi-modal LLMs Truly Understand Diagrams in Visual Math Problems? 🧐 Interest in visual math reasoning has surged in the era of Multi-modal LLMs ( #MLLMs ). Although showing promising potential, it remains uncertain whether MLLMs utilize visual or textual shortcuts to…
Tweet media one
@_akhaliq
AK
2 months
MathVerse Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems? The remarkable progress of Multi-modal Large Language Models (MLLMs) has garnered unparalleled attention, due to their superior performance in visual contexts. However, their capabilities in
Tweet media one
1
72
260
1
34
213
@lupantech
Pan Lu @ ICLR 2024
6 months
🔥 Introducing #SPHINX 🦁: an all-in-one multimodal LLM with a unified interface that seamlessly integrates domains, tasks, & embeddings. 🧵N 👋 Explore the @Gradio demo @_akhaliq : Dive into the open resources! 🤗 Model @huggingface :…
Tweet media one
13
53
212
@lupantech
Pan Lu @ ICLR 2024
9 months
🎉 Just reached 1000 citations on Google Scholar! Grateful to be part of a community that values and engages with my research. Here's to continued curiosity and exploration! 🔍
Tweet media one
7
0
189
@lupantech
Pan Lu @ ICLR 2024
7 months
🤔 Ever wondered why foundation models like LLMs & LMMs are only tested on textual math reasoning benchmarks? 🔍 Dive into our #MathVista for a fresh perspective: ! 🌟 Introducing #MathVista : A groundbreaking benchmark for visual mathematical reasoning –…
Tweet media one
Tweet media two
Tweet media three
13
49
186
@lupantech
Pan Lu @ ICLR 2024
1 year
🌟Last week, I am honored to present our latest work #Chameleon to the Reasoning Team at Google Brain @DeepMind . It's encouraging to witness tool-augmented LLMs like Transformer Agents @huggingface and Chameleon garnering significant attention. 🧵6 Slides:
Tweet media one
Tweet media two
Tweet media three
Tweet media four
4
33
166
@lupantech
Pan Lu @ ICLR 2024
4 months
Model editing has been an effective way to reduce hallucinations in LLMs, instead of undergoing resource-intensive retraining. 🤯However, our study, led by @JasonForJoy , @kaiwei_chang , & @VioletNPeng , reveals that current methods inadvertently impair the general skills of LLMs.…
Tweet media one
1
30
159
@lupantech
Pan Lu @ ICLR 2024
2 years
🚨Struggling to select examples for GPT-3? Try our PromptPG, the first work that applies RL to select in-context examples for GPT-3! PromptPG achieves a gain of 5.31% on TabMWP, a new dataset of tabular math word problems! Check out data and codes:👇 🧵1/7
2
22
154
@lupantech
Pan Lu @ ICLR 2024
2 years
🚨Thrilled to have one paper accepted to #NeurIPS2022 ! We construct a new benchmark, ScienceQA, and design language models to learn to generate lectures and explanations as the chain of thought to mimic the multi-hop reasoning process. Data and code will be coming soon!
Tweet media one
Tweet media two
Tweet media three
2
14
146
@lupantech
Pan Lu @ ICLR 2024
2 years
📢📢Excited to have one paper accepted to #NeurIPS2022 ! We present a new dataset, ScienceQA, and develop large language models to learn to generate lectures and explanations as the chain of thought (CoT). Data and code are public now! Please check👇👇
Tweet media one
Tweet media two
Tweet media three
Tweet media four
4
27
145
@lupantech
Pan Lu @ ICLR 2024
7 months
🔥 Exciting Update! We've manually evaluated #GPT4V using the playground chatbot on #MathVista , our newest benchmark for visual mathematical reasoning. 🚀 #GPT4V soared with a 15.1%⬆️ improvement over #Bard , setting a new record at 49.9%! 🎉 🌐 Yet,…
Tweet media one
3
27
135
@lupantech
Pan Lu @ ICLR 2024
1 year
Our #Chameleon ranked #1 among 1682 AI papers last week by @alphasignalai , emphasizing the significant impact our work has made. #Chameleon is a plug-and-play reasoning framework, enabling LLMs to utilize diverse tools. 🔗 🎉 More:
Tweet media one
1
35
132
@lupantech
Pan Lu @ ICLR 2024
10 months
🤖 Could #LLMs develop emotional intelligence to undestand human social interactions? Introducing KokoMind 🦍: a benchmark to evaluate how #gpt4 , #chatgpt , & #claude interpret conversations and relations, and contribute with insightful advices. 💥 Demo:
Tweet media one
@shi_weiyan
Weiyan Shi
10 months
Put ChatGPT at a cocktail party🥂. Can it - understand people's conversations, gestures - figure out their relations, - and even chime in with social advice? 🦍Announce KokoMind. 🌟Check out this demo! More at #AI #GPT4 #ChatGPT #OpenAI #Shrinking 🧵
13
87
303
4
27
129
@lupantech
Pan Lu @ ICLR 2024
2 months
🚀🎉 Introducing X-Accessory's new member: Large Diffusion Transformer (Large-DiT)! 🎆✨ 🔗 💪 We're pushing boundaries by expanding diffusion transformers to 7B parameters. Here are our features: 🧵6 1⃣ Model Scaling-up 📈: Scale to 3B and 7B by merging…
Tweet media one
7
20
98
@lupantech
Pan Lu @ ICLR 2024
2 years
Can machines answer multi-modal math word problems? We proposed a new task, Icon Question Answering #IconQA , to deal with it! Details are available below: Paper: Project: Code:
Tweet media one
Tweet media two
Tweet media three
3
25
96
@lupantech
Pan Lu @ ICLR 2024
5 months
Tweet media one
0
3
94
@lupantech
Pan Lu @ ICLR 2024
5 months
📢 Can't wait to see you at the 3rd #MathAI Workshop in the LLM Era at #NeurIPS2023 ! ⏰ 8:55am - 5:00pm, Friday, Dec 15 📍 Room 217-219 🔗 📽️ Exciting Lineup: ⭐️ Six insightful talks by @KristinLauter , @BaraMoa , @noahdgoodman ,…
Tweet media one
4
20
88
@lupantech
Pan Lu @ ICLR 2024
1 month
Excited to announce the AI for Math Workshop at #ICML2024 @icmlconf ! Join us for groundbreaking discussions on the intersection of AI and mathematics. 🤖🧮 📅 Workshop details: 📜 Submit your pioneering work: 🏆 Take on our…
Tweet media one
Tweet media two
2
15
88
@lupantech
Pan Lu @ ICLR 2024
2 months
🤖In sciences and finance, we often engage in statistical and causal reasoning with structured data. Ever dreamed of #LLMs doing the heavy lifting, clearing the path from the maze of complex and error-prone tasks? 🤯 Hold that thought! 🛑 Our findings reveal that even GPT-4…
Tweet media one
@xxxxiaol
Xiao Liu
2 months
Are LLMs Capable of Data-based Statistical and Causal Reasoning? In this work, we propose a benchmark QRData (Quantitative Reasoning with Data) to evaluate models' capability in statistical and causal reasoning with real-world data. 🌐:
Tweet media one
1
24
81
0
21
87
@lupantech
Pan Lu @ ICLR 2024
6 months
I am honored to win the @Qualcomm Innovation Fellowship! A heartfelt thank you to @kaiwei_chang for your kind words and encouragement. I am grateful to our team, including @liujc1998 and Professor @HannaHajishirzi . This achievement wouldn't have been possible without you all! ❤️
@uclanlp
uclanlp
6 months
Congrats @lupantech for winning the 2023 Qualcomm Innovation Fellowship! 🐻 Pan is a rock star in math and scientific reasoning in NLP!
0
3
20
3
5
86
@lupantech
Pan Lu @ ICLR 2024
1 year
🔥Thrilled to announce that our LLaMA-Adapter has been featured in Lit-LLaMA by @LightningAI 🦙🦙 🚀 Check out our LLaMA-Adapter here: ⚡️ Explore Lit-LLaMA on GitHub:
@LightningAI
Lightning AI ⚡️
1 year
Progress update!🦙🔥🤓 Lit-LLaMA now implements the LLaMA-Adapter method for efficient fine-tuning 🔧⚡️ The core idea can be implemented in about 11 lines of code🤯 (see screenshot) Link to repo👉 Link to Adapter paper👉
Tweet media one
2
41
170
2
12
85
@lupantech
Pan Lu @ ICLR 2024
5 months
💥💥Update Alert! Radar graphs & leaderboard on #MathVista now feature detailed scores for the #Gemini family models. 🚀 🔍 Insight: Gemini Ultra leads the pack, outperforming GPT-4V by 3.1%! Yet, each model shines uniquely in various math reasoning & visual contexts. 🙏 Big…
Tweet media one
Tweet media two
2
16
83
@lupantech
Pan Lu @ ICLR 2024
11 months
Privileged to have the opportunity to guest lecture on #NLP course @CS_UCLA , instructed by Prof. @kaiwei_chang . I really enjoyed it and am so glad to share recent advancements in mathematical reasoning and commonsense reasoning.🧵3 🔗Check out the slides:
Tweet media one
4
7
79
@lupantech
Pan Lu @ ICLR 2024
11 months
Excited to explore my research internship @MSFTResearch this summer! Cheers!🍻🍻
Tweet media one
0
1
77
@lupantech
Pan Lu @ ICLR 2024
5 months
Hey Friends! 🎉 Excited to be at #NeurIPS2023 ! 🚀 I’ll be presenting a paper 📄, co-organizing the MATH-AI workshop 🧮, and sharing three collaborative projects. Can't wait to meet you in New Orleans 🎭 and explore the AI advancements in math, science, and more! 🤖🧪 👇1⃣2⃣3⃣4⃣…
Tweet media one
1
5
78
@lupantech
Pan Lu @ ICLR 2024
1 year
🦙Please check out LLaMA-Adapter-V2, performing open-ended multi-modal visual instructions by merely introducing 14M learnable parameters over 65B #LLaMA . abs: repo: weights: video:
@lupantech
Pan Lu @ ICLR 2024
1 year
🚀65B LLaMA-Adapter-V2 code & checkpoint are NOW ready at ! 🛠️Big update enhancing multimodality & chatbot. 🔥LLaMA-Adapter-V2 surpasses #ChatGPT in response quality (102%:100%) & beats #Vicuna in win-tie-lost (50:14). ☕️Thanks to Peng Gao & @opengvlab ! 2/2
11
104
415
0
22
78
@lupantech
Pan Lu @ ICLR 2024
5 months
Excited to see the release of Gemini! It is more excited to see that Gemini @google features MathVista for evaluating math reasoning in visual contexts and Geometry3K for evaluating geometry reasoning!! Congratulations and thanks @GoogleDeepMind , @GoogleResearch , and @Google !…
Tweet media one
Tweet media two
@JeffDean
Jeff Dean (@🏡)
5 months
I’m very excited to share our work on Gemini today! Gemini is a family of multimodal models that demonstrate really strong capabilities across the image, audio, video, and text domains. Our most-capable model, Gemini Ultra, advances the state of the art in 30 of 32 benchmarks,…
Tweet media one
Tweet media two
276
3K
13K
1
5
75
@lupantech
Pan Lu @ ICLR 2024
9 months
We're organizing the 3rd #MathAI workshop at @NeurIPSConf #NeurIPS . 🚀 Excited for our speakers on AI for mathematical reasoning, @guyvdb , @noahdgoodman , @wtgowers , @BaraMoa , @KristinLauter , @TaliaRinger , @paul_smolensky , Armando Solar-Lezama, @Yuhu_ai_ , @ericxing , @denny_zhou .…
Tweet media one
0
11
69
@lupantech
Pan Lu @ ICLR 2024
1 year
📢Great news! Our #ScienceQA dataset is gaining significant attention lately. It is the primary benchmark for the next-gen #MultimodalCoT reasoning system by @AmazonScience , and it's now included in @huggingface : . More details: 👉
Tweet media one
1
15
67
@lupantech
Pan Lu @ ICLR 2024
1 month
Spent a fantastic weekend at Lake Arrowhead with the @uclanlp group! ❄️🏔️⬆️ Enjoyed scenic drives, delicious meals, engaging conversations, and brainstorming sessions. Truly inspiring! 🚗🥘😋💬 🖼️🧠💡
Tweet media one
2
6
67
@lupantech
Pan Lu @ ICLR 2024
1 year
🌟 Excited about the releases of the #ChatGPT App and #Zelda game? 🚀 Check out the power of our multimodal LLaMA- #Adapter , with a performance that echoes the potential of the visual #GPT4 . 💥 Stay tuned for the upcoming V2 demo, multimodal Arena, checkpoints, and much more!
Tweet media one
Tweet media two
Tweet media three
Tweet media four
3
17
61
@lupantech
Pan Lu @ ICLR 2024
2 months
🤯So thrilled to have @AnthropicAI benchmark their latest, powerful Claude 3 models on our #MathVista for visual math reasoning! It's encouraging to see the rapid progress in (multimodal) LLMs, especially in the math and science fields! 💥 🤗 Our @huggingface Data:…
Tweet media one
@AnthropicAI
Anthropic
2 months
Today, we're announcing Claude 3, our next generation of AI models. The three state-of-the-art models—Claude 3 Opus, Claude 3 Sonnet, and Claude 3 Haiku—set new industry benchmarks across reasoning, math, coding, multilingual understanding, and vision.
Tweet media one
559
2K
10K
1
7
52
@lupantech
Pan Lu @ ICLR 2024
1 year
🔥Thrilled to see our #LLaMA -Adapter featured in @HuggingFace 's "Spaces of the Week"! 🎉 Introducing LLaMA-Adapter V2, our cutting-edge multi-modal instruction model! Explore demo examples here: 💡 🚀Stay tuned for the technical report and model release!
Tweet media one
Tweet media two
0
10
51
@lupantech
Pan Lu @ ICLR 2024
2 years
It has been a wonderful day at Open House @allen_ai 🍺🍖🌊. I met a lot of great people and got inspiring advice. Many thanks to the great efforts of the operations team for preparing all of it!
Tweet media one
Tweet media two
0
2
49
@lupantech
Pan Lu @ ICLR 2024
6 months
🚀 Our @Gradio demo now supports diverse vision-language tasks: 1️⃣ Visual Question Answering (VQA) 2️⃣ Multi-level Dense Caption 3️⃣ Referring Expression Comprehension 4️⃣ Relationship Grounding 5️⃣ Grounding Captions 6️⃣ Object Detection 7️⃣ Human Keypoint Detection 8️⃣ Text Detection…
Tweet media one
0
11
49
@lupantech
Pan Lu @ ICLR 2024
6 months
Deeply honored to have won the @Qualcomm Innovation Fellowship this year. It fills me with immense pride to be a part of the @CS_UCLA community.
@UCLAComSci
UCLA Computer Science
6 months
PhD Student Pan Lu Wins 2023 Qualcomm Innovation Fellowship Read more:
0
0
6
8
1
47
@lupantech
Pan Lu @ ICLR 2024
1 year
🌟Powered by #DALLE2 , #LLM unveils the potential for Multimodal Procedural Planning (MPP): generating coherent and authentic multimodal plans with multiple steps to reach high-level goals. Explore our latest work: abs: data & code:
Tweet media one
1
11
48
@lupantech
Pan Lu @ ICLR 2024
25 days
🎉 Exciting news! Our #MathVista is excelling with the latest advances in vision-language models (VLMs). Grok-1.5V by @xai achieves a 52.8% score, surpassing leading models such as GPT-4V, Claude 3 Opus, and Gemini Pro 1.5! 🔗 Visit our project page: 👀…
Tweet media one
@xai
xAI
28 days
👀
621
1K
7K
1
4
46
@lupantech
Pan Lu @ ICLR 2024
5 months
Congratulations and thanks to @MistralAI for releasing the #MoE model to the community. Our LLaMA2-Accessory now features Mixtral-8x7b with a chatbot demo, available on @Gradio ! Try the Chatbot: http://106.14.127.192/ For more implementation details: 📖 Documentation:…
Tweet media one
0
10
43
@lupantech
Pan Lu @ ICLR 2024
7 months
📢 Attention #NLPoc community! Submit and showcase your research at the 4th Southern California Natural Language Symposium (SoCal NLP) 📜 🗓️ Submission Deadline: Oct. 21, 2023, 11:59 PM PT 🔗 More info: #SoCalNLP #CallForPapers
Tweet media one
1
13
45
@lupantech
Pan Lu @ ICLR 2024
1 year
Thanks for sharing our work! 🦙🍻
@_akhaliq
AK
1 year
LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model Compared to the original LLaMAAdapter, LLaMA-Adapter V2 can perform open-ended multi-modal instructions by merely introducing 14M parameters over LLaMA abs: github:
Tweet media one
3
100
344
0
7
44
@lupantech
Pan Lu @ ICLR 2024
5 months
Gratitude to our esteemed speakers, insightful panelists, engaged attendees, and dedicated organizers ( @LiangZhenwen , @AlbertQJiang , @katie_m_collins , @KaiyuYang4 , @wellecks , and @JLMcClelland ) for making the 3rd #MATHAI workshop at #NeurIPS2023 an extraordinary success!!
Tweet media one
@lupantech
Pan Lu @ ICLR 2024
5 months
📢 Can't wait to see you at the 3rd #MathAI Workshop in the LLM Era at #NeurIPS2023 ! ⏰ 8:55am - 5:00pm, Friday, Dec 15 📍 Room 217-219 🔗 📽️ Exciting Lineup: ⭐️ Six insightful talks by @KristinLauter , @BaraMoa , @noahdgoodman ,…
Tweet media one
4
20
88
1
4
42
@lupantech
Pan Lu @ ICLR 2024
10 months
🚀We've just launched #SciBench , a sophisticated, college-level benchmark. It uniquely evaluates the capabilities of LLMs in tackling scientific problem-solving.
@_akhaliq
AK
10 months
SciBench: Evaluating College-Level Scientific Problem-Solving Abilities of Large Language Models paper page: Recent advances in large language models (LLMs) have demonstrated notable progress on many mathematical benchmarks. However, most of these…
Tweet media one
2
17
67
1
8
40
@lupantech
Pan Lu @ ICLR 2024
4 months
In 2021, we explored early research in geometry: our Inter-GPS, a neuro-symbolic solver, reached average human-level score for the first time.🎉 Now, @GoogleDeepMind 's AlphaGeometry marks a historic breakthrough: Olympiad-level skill!🚀 🔎For more: 🔗…
Tweet media one
@GoogleDeepMind
Google DeepMind
4 months
Introducing AlphaGeometry: an AI system that solves Olympiad geometry problems at a level approaching a human gold-medalist. 📐 It was trained solely on synthetic data and marks a breakthrough for AI in mathematical reasoning. 🧵
114
1K
4K
1
8
36
@lupantech
Pan Lu @ ICLR 2024
2 years
Happy to receive the NeurIPS 2022 Scholar Award! I really appreciate every support I get from the community, and I will devote myself to making contributions to the community! @NeurIPSConf 🍻See you in New Orleans!
Tweet media one
1
1
38
@lupantech
Pan Lu @ ICLR 2024
5 months
⭐️ Awesome! @guyvdb from UCLA is presenting the talk "AI Can Learn from Data. But Can It Learn to Reason?" offering insights from a logical and probabilistic perspective! #MATHAI #NeurIPS23 #Logic #Reasoning #AI
Tweet media one
@lupantech
Pan Lu @ ICLR 2024
5 months
📢 Can't wait to see you at the 3rd #MathAI Workshop in the LLM Era at #NeurIPS2023 ! ⏰ 8:55am - 5:00pm, Friday, Dec 15 📍 Room 217-219 🔗 📽️ Exciting Lineup: ⭐️ Six insightful talks by @KristinLauter , @BaraMoa , @noahdgoodman ,…
Tweet media one
4
20
88
0
3
36
@lupantech
Pan Lu @ ICLR 2024
10 months
🧲Please stop by our poster on deep learning for math reasoning at Poster Session 2 @aclmeeting #ACL2023NLP . ❤️Thanks to co-authors for their great contributions: @liangqiu_1994 , @wyu_nd , @wellecks , & @kaiwei_chang . abs: github: …
Tweet media one
0
5
34
@lupantech
Pan Lu @ ICLR 2024
6 months
🚀 @google is introducing new updates to aid in learning math and science, especially in visual contexts: . 💥 We're proud to spotlight our commitment to math and science over the past years, with projects like #MathVista , #Chameleon , and #ScienceQA . 1️⃣…
Tweet media one
0
10
33
@lupantech
Pan Lu @ ICLR 2024
5 months
🚨 Attention! I'm presenting the 🦎 #Chameleon paper at Booth 320 from 10:45 to 12:45 at #NeurIPS23 . You're welcome to stop by for a chat! ☕️😉🤖🧲💡 For more details, check out our project at .
Tweet media one
@_akhaliq
AK
1 year
Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models Chameleon with GPT-4 achieves an 86.54% accuracy on ScienceQA, significantly improving upon the best published few-shot model by 11.37%; using GPT-4 as the underlying LLM, Chameleon achieves a 17.8%…
Tweet media one
0
101
416
2
3
34
@lupantech
Pan Lu @ ICLR 2024
5 months
It is remarkable that Gemini achieves a new SOTA of 53.0% on MathVista (), a challenging benchmark for math reasoning in visual contexts. We are honored that our proposed #MathVista is advancing the development of the newest and most capable AI models.
@JeffDean
Jeff Dean (@🏡)
5 months
In image understanding, Gemini performs well across all the benchmarks we examined, with the Ultra model setting new state-of-the-art results in every benchmark.
Tweet media one
4
9
192
0
3
34
@lupantech
Pan Lu @ ICLR 2024
11 months
🚀OpenAI is releasing the latest function and tool-calling update for #GPT4 ! Just two months back, we introduced #Chameleon 🦎, an innovative compositional reasoning framework. It uses LLMs as a planner to generate diverse programs, integrating various tools including LLMs,…
Tweet media one
Tweet media two
0
6
33
@lupantech
Pan Lu @ ICLR 2024
1 year
It was great to attend the #NeurIPS2022 poster session and present our work @UCLA @ASU @allen_ai in person🎉. I’m excited that I met many great people and got countless insightful advice and comments. Thanks to everyone for your interest in our work!🍻
Tweet media one
Tweet media two
Tweet media three
Tweet media four
0
4
32
@lupantech
Pan Lu @ ICLR 2024
2 years
🎯It is time to submit your work on mathematical reasoning to the 2nd MATH-AI workshop! As the workshop is non-archival, papers that are recently published or under review are allowed. ⏰The submission deadline is due on Sep 29⏰. ✅✅More information:
Tweet media one
Tweet media two
0
7
30
@lupantech
Pan Lu @ ICLR 2024
1 year
Thanks for sharing our latest work on multimodal procedural planning 🍻
@_akhaliq
AK
1 year
Multimodal Procedural Planning via Dual Text-Image Prompting abs: github:
Tweet media one
0
36
129
0
4
30
@lupantech
Pan Lu @ ICLR 2024
1 year
🎉🎉I am really happy that the 2nd MATH-AI workshop ended with such a big success. Very encouraged that so many people are interested in the domain and that the community is growing rapidly. Huge thanks to the speakers, panelists, and organizers! See you all at future events!!🍻
Tweet media one
Tweet media two
Tweet media three
Tweet media four
2
1
30
@lupantech
Pan Lu @ ICLR 2024
5 months
🎉 Exciting News! X-Accessory now welcomes a new addition - Mistral-MoE! 🌟 Discover it here: 🚀 Tap into the power of Mistral-MoE with our X-Accessory's robust framework, with the new features of inference and LoRA fine-tuning via model parallelism. 🌐…
Tweet media one
Tweet media two
0
7
29
@lupantech
Pan Lu @ ICLR 2024
3 months
😜Looking forward to seeing you at the 1st Tool-Augmented Vision (TAVI) Workshop at #CVPR2024 in Seattle. 🔍For more details, please visit the website:
Tweet media one
@ahmetius
Ahmet Iscen
3 months
We will be organizing the 1st Tool-Augmented VIsion (TAVI) Workshop at #CVPR2024 . We are looking forward to having an exciting list of keynote speakers covering various topics about tool-use and retrieval augmented models. More details at:
1
10
33
0
4
29
@lupantech
Pan Lu @ ICLR 2024
1 year
We're dedicated to #OpenSource , confident that it will profoundly enrich the community.🌟 Thrilled to see our recent work, LLaMA-Adapter, and its subsequent developments positively impacting the community.🚀 Stay updated with continuous improvements: 📌
@rasbt
Sebastian Raschka
1 year
It was a great month for open source: So many LLMs came out that it's become quite overwhelming to keep track of it all. So, in this month's Ahead of AI issue, I am sharing resources and research insights on the latest open-source LLMs & datasets!
13
128
550
0
7
26
@lupantech
Pan Lu @ ICLR 2024
2 years
🚨Call for Papers🚨 Submission to the #NeurIPS2022 MATH-AI Workshop will be due on Sep 30, 11:59pm PT (2 days after ICLR😆). The page limit is 4 pages (not much workload🤩). Work both in progress and recently published is allowed. Act NOW and see you in #NewOrleans !🥳🥳🍻
Tweet media one
Tweet media two
Tweet media three
0
9
26
@lupantech
Pan Lu @ ICLR 2024
5 months
One model to align multiple modalities. Looking forward to seeing the live demo.
@_akhaliq
AK
5 months
OneLLM: One Framework to Align All Modalities with Language paper page: Multimodal large language models (MLLMs) have gained significant attention due to their strong multimodal understanding capability. However, existing works rely heavily on…
Tweet media one
5
71
257
0
4
25
@lupantech
Pan Lu @ ICLR 2024
1 year
An excellent blog on Controllable Neural Text Generation from @lilianweng ! It's important to consider ways to reduce the hallucinations of LLMs and better reflect human intentions, especially given their current success and limitations. 👉 #ChatGPT #LLM
0
3
26
@lupantech
Pan Lu @ ICLR 2024
1 year
Thrilled to join the live event, thanks to @LightningAI 's kind invitation! 🌟 Peng and I will share the insights behind the LLaMA-Adapter series. 📅 event: 📚 abs-1: 📚 abs-2: 💻 code:
Tweet media one
0
7
25
@lupantech
Pan Lu @ ICLR 2024
11 months
@kajikent Hi @kajikent , thanks so much for sharing our work! 私たちの作品を共有してくれてありがとう!
1
1
24
@lupantech
Pan Lu @ ICLR 2024
1 year
Excited to be at #AAAI23 on-site! Can't wait to catch up with old friends and make new ones. 📢I'll give an oral presentation on #ScienceQA () at @knowledgenlp Workshop on Monday, Feb 13, 2:15-3:15 pm in Room 144B. If you're around, let's grab a coffee!
Tweet media one
0
1
24
@lupantech
Pan Lu @ ICLR 2024
1 year
📢📢Welcome to the 2nd #MATH -AI workshop tomorrow (Sunday, Dec 03) in Rooms 293-294 at #NeurIPS2022 if you are interested in math reasoning and AI! There are 6 invited talks, 3 contributed talks, 1 poster session, and 1 panel discussion. 🪜Full program:
Tweet media one
0
7
23
@lupantech
Pan Lu @ ICLR 2024
1 year
🔥The ChatGPT API has just been released! #ChatGPT
Tweet media one
1
2
21
@lupantech
Pan Lu @ ICLR 2024
2 months
Excited to see the breakthrough achieved by @Apple 's MM1 model, as evidenced by our #MathVista (), the comprehensive benchmark for math reasoning in visual contexts!
@mckbrando
Brandon McKinzie
2 months
Few-shot mixed-resolution CoT: we can keep the strong few-shot capabilities learned from multimodal pre-training even after instruction-tuning: MM1-30B-Chat achieves 39.4 zero-shot on MathVista, but with eight-shot CoT mixed-resolution prompting we can achieve 44.4.
Tweet media one
1
4
24
0
1
20
@lupantech
Pan Lu @ ICLR 2024
11 months
🧵1/6 Experience the magic of LLaMA-Adapter! Transforming real-world inputs like text, images, videos, audio, and 3D point clouds into engaging text. The reality you know, reimagined through AI. 🖼️📽️🔉🌐➕📝 ➡️➡️🦙➡️➡️ 📝
Tweet media one
Tweet media two
Tweet media three
Tweet media four
2
4
20
@lupantech
Pan Lu @ ICLR 2024
1 year
Had a great time at #SoCalNLP last week. Loving the beautiful and peaceful campus at #UCSB .
Tweet media one
Tweet media two
Tweet media three
0
1
20
@lupantech
Pan Lu @ ICLR 2024
2 years
🧐Looking for a well-designed benchmark for mathematical reasoning? Lila 📜 is your next best option! 🥳🥳
@mattf1n
Matthew Finlayson @ ICLR
2 years
Can a language model help you with your math homework? Not on its own, but maybe with the help of a Python interpreter! In our EMNLP paper we present 📜 Līla and 🤖 Bhāskara, a math reasoning benchmark and model. 📄: 🔗: 1/🧵
Tweet media one
Tweet media two
5
38
214
0
3
18
@lupantech
Pan Lu @ ICLR 2024
1 year
Absolutely thrilled to share that Tony Xia @CS_UCLA has been accepted into @Stanford 's Computer Science MS program! It was an honor to write his recommendation and have mentored such a talented undergraduate since 2020. Wishing him all the best as he pursues his academic dreams.
Tweet media one
0
0
18
@lupantech
Pan Lu @ ICLR 2024
2 years
Excited to organize the 2nd MATHAI workshop @NeurIPSConf with our great team❤️! The workshop will be in New Orleans🏙️ in person, on December 03, 2022. The submission is open now🧲! #NeurIPS2022
@Yuhu_ai_
Yuhuai (Tony) Wu
2 years
🚨We are organizing the 2nd MATHAI workshop at NeurIPS! Check it out if you're interested in AI for math, and machine reasoning in general🤯! We have a great lineup of speakers & panelists! See more in call for papers: 👇
Tweet media one
3
32
150
0
2
18
@lupantech
Pan Lu @ ICLR 2024
1 year
🥳Trilled in New Orleans for #NeurIPS ! This year, I will present one paper (ScienceQA) + 2 WS papers (PromptPG, Lila). And I am co-organizing the 2nd MATH-AI workshop! ☕️Excited to meet you! DM me if you want to grab a coffee and chat about MathAI, LLMs, and trustworthy NLP!!👇
1
1
17
@lupantech
Pan Lu @ ICLR 2024
16 days
An insightful fireside chat by Sam Altman! Looking forward to the potential of generative AI models that facilitate solving the common challenges that all human beings face! #OpenAI #GenAI
Tweet media one
0
0
16
@lupantech
Pan Lu @ ICLR 2024
1 year
Evaluating response quality with GPT-4, LLaMA-Adapter-V2 outshines ChatGPT. It triumphs over #ChatGPT in response quality, scoring 102%:100%! 🚀
Tweet media one
2
5
15
@lupantech
Pan Lu @ ICLR 2024
2 years
🎉YES! It is exciting to see the growing community on Math&AI! Thank the organizing team @Swarooprm7 @wellecks @Yuhu_ai_ @HannaHajishirzi @percyliang for their great efforts to make this happen! 👏👏 The acceptance notification will be announced on October 20. Stay tuned! 😆
@Yuhu_ai_
Yuhuai (Tony) Wu
2 years
Compared to the 1st MATHAI workshop 1 year ago, the number of submissions this time almost doubled! Glad to see the field is growing rapidly 🙌 Also there are many mind-blowing works 🤯🤯 Stay tuned!
1
3
36
0
3
13
@lupantech
Pan Lu @ ICLR 2024
2 years
The data visualization page is now here at . You can play with it now to see what ScienceQA looks like🧐. Data and code will also be ready in the next couple of weeks.🥳
@lupantech
Pan Lu @ ICLR 2024
2 years
🚨Thrilled to have one paper accepted to #NeurIPS2022 ! We construct a new benchmark, ScienceQA, and design language models to learn to generate lectures and explanations as the chain of thought to mimic the multi-hop reasoning process. Data and code will be coming soon!
Tweet media one
Tweet media two
Tweet media three
2
14
146
1
2
14
@lupantech
Pan Lu @ ICLR 2024
5 months
Great time at the Meta party!! 😀✌️🎷💫🍺 #NeurIPS2023
@ylecun
Yann LeCun
5 months
Meta party at #neurips2023 Giant selfie.
Tweet media one
21
18
614
0
0
13
@lupantech
Pan Lu @ ICLR 2024
2 months
🗳️🗳️If you've attended #EMNLP in the past 3 years, please check your email to vote for the SIGDAT VP-elect by 3/24. Your vote is important to thrive the #NLP community!
@kaiwei_chang
Kai-Wei Chang
2 months
I am honored to be nominated by SIGDAT (the org that oversees EMNLP) to run for VP-elect with other awesome candidates who share the goal of improving our community. Please check your email to vote by 3/24.🗳️ See details:
Tweet media one
3
36
134
0
1
13
@lupantech
Pan Lu @ ICLR 2024
1 year
With a mere 1.2 million learnable parameters, LLaMA-Adapter demonstrates superior reasoning capacity on #ScienceQA , surpassing a diverse range of multi-modal and LLM models, such as fully-finetuned MM-COT and few-shot GPT-3.
Tweet media one
1
1
12
@lupantech
Pan Lu @ ICLR 2024
2 years
If you cannot wait😆, an informal version of the paper is available at .
@lupantech
Pan Lu @ ICLR 2024
2 years
🚨Thrilled to have one paper accepted to #NeurIPS2022 ! We construct a new benchmark, ScienceQA, and design language models to learn to generate lectures and explanations as the chain of thought to mimic the multi-hop reasoning process. Data and code will be coming soon!
Tweet media one
Tweet media two
Tweet media three
2
14
146
1
1
13
@lupantech
Pan Lu @ ICLR 2024
8 months
Thrilled to see OmniQuant – a crucial development for compressing large language models! It's astounding that it can quantize 7B-70B LLaMa-2 models in just 1 to 16 hours using 128 samples, and it even supports mobile phones. 🔗 Code:
@opengvlab
OpenGVLab
8 months
Thank AK @_akhaliq for the post. 🔥 Excited to introduce OmniQuant - An advanced open-source algorithm for compressing large language models! 📜 Paper: 🔗 Code: 💡 Key Features: 🚀Omnidirectional Calibration: Enables easier weight…
Tweet media one
2
25
78
0
0
12
@lupantech
Pan Lu @ ICLR 2024
11 months
📽️LLaMa-Adapter Multimodal supports [Video] input. 👀From cinematic masterpieces to topical news footage, it's designed to perceive and appreciate the diverse content in videos. Stay tuned for our live demo: 🧵2/6
Tweet media one
2
2
12
@lupantech
Pan Lu @ ICLR 2024
5 months
Super excited to have @xinyun_chen_ present the great work on Analogical Reasoning with @denny_zhou . Don't miss the insightful talk this afternoon at the MathAI Workshop at #NeurIPS2023 . ⏰ 4:00pm - 4:30pm 📍 Room 217-219
@denny_zhou
Denny Zhou
5 months
A simple yet effective approach to fill the performance gap between zero-shot and few-shot prompting Xinyun Chen @xinyun_chen_ is going to present our recent work LLM analogical reasoning () this afternoon in the exciting #MathAI workshop of #NeurIPS2023 .…
3
20
100
0
0
11
@lupantech
Pan Lu @ ICLR 2024
5 months
Congratulations! Honored to witness the rapid growth of the @UCLAComSci community!
@UCLAengineering
UCLA Samueli Engineering
5 months
Congrats to @UCLA Asst. Prof. @adityagrover_ and incoming Asst. Prof. Saadia Gabriel @GabrielSaadia of @CS_UCLA on being named to @Forbes ' 30 Under 30 list in science. Grover and Gabriel were each recognized for their work using artificial intelligence.
0
4
20
0
0
11
@lupantech
Pan Lu @ ICLR 2024
6 months
💥Congrats to Sean on launching the L3 Lab at CMU! I am honored to have collaborated with him on two papers and co-organized three MathAI workshops. He is definitely the rising star 🚀 in the field, and I have learnt a lot from his great vision and excellent leadership!
@wellecks
Sean Welleck
6 months
Announcing the L3 Lab at CMU! We focus on Learning, Language, and Logic, including: - Principles of ML for language - ML in high-trust areas, such as verifying math and programs - ML systems that improve over time Recruiting PhD students for fall 2024!
Tweet media one
11
95
533
1
0
11
@lupantech
Pan Lu @ ICLR 2024
5 months
Starting Now! Welcome to our first talk by @KristinLauter , Director at FAIR Labs, North America, Meta. She'll be sharing the latest progress in AI4Crypto: Machine Learning attacks on Post-Quantum Crypto schemes! #NeurIPS2023 #MATHAI #AI #Cryptography #QuantumComputing
Tweet media one
@lupantech
Pan Lu @ ICLR 2024
5 months
📢 Can't wait to see you at the 3rd #MathAI Workshop in the LLM Era at #NeurIPS2023 ! ⏰ 8:55am - 5:00pm, Friday, Dec 15 📍 Room 217-219 🔗 📽️ Exciting Lineup: ⭐️ Six insightful talks by @KristinLauter , @BaraMoa , @noahdgoodman ,…
Tweet media one
4
20
88
1
3
11