Nomic AI Profile Banner
Nomic AI Profile
Nomic AI

@nomic_ai

13,991
Followers
50
Following
98
Media
707
Statuses

Building explainable and accessible AI

Joined April 2022
Don't wanna be here? Send us removal request.
Pinned Tweet
@nomic_ai
Nomic AI
3 months
Announcing Nomic Embed v1.5 🪆🪆🪆 - Variable sized embeddings with matryoshka learning and an 8192 context. - Outperforms OpenAI text-embedding-3-small across output sizes. - Open source, open training code, open data. Day 0 in @LangChainAI , @llama_index and @MongoDB
11
84
476
@nomic_ai
Nomic AI
1 year
Today we're releasing GPT4All, an assistant-style chatbot distilled from 430k GPT-3.5-Turbo outputs that you can run on your laptop.
44
348
2K
@nomic_ai
Nomic AI
3 months
Introducing Nomic Embed - the first fully open long context text embedder to beat OpenAI - Open source, open weights, open data - Beats OpenAI text-embeding-3-small and Ada on short and long context benchmarks - Day 1 integrations with @langchain , @llama -index, @MongoDB
38
276
2K
@nomic_ai
Nomic AI
1 year
first?
Tweet media one
23
140
1K
@nomic_ai
Nomic AI
11 months
Local LLMs now have plugins! 💥 GPT4All LocalDocs allows you chat with your private data! - Drag and drop files into a directory that GPT4All will query for context when answering questions. - Supports 40+ filetypes - Cites sources.
43
181
820
@nomic_ai
Nomic AI
10 months
GPT4All now supports Text Embeddings ⚡ - Generate text embeddings of arbitrary length documents for free on CPU at 8,000 tok/second. - No external dependencies except C.
Tweet media one
23
145
674
@nomic_ai
Nomic AI
2 years
How do people use #stablediffusion ? Explore 6.4 million AI generated images from @krea_ai in Atlas.
13
163
630
@nomic_ai
Nomic AI
1 year
Big New Release of GPT4All📶 You can now use local CPU-powered LLMs through a familiar API! Building with a local LLM is as easy as a 1 line code change! Simply spin up the chat app at and place it in server mode! Documentation:
Tweet media one
19
145
605
@nomic_ai
Nomic AI
11 months
GPT4All now supports 100+ more models!💥 Nearly every custom ggML model you find @huggingface for CPU inference will *just work* with all GPT4All software with the newest release! Instructions:
Tweet media one
10
92
413
@nomic_ai
Nomic AI
1 year
Huge Release of GPT4All 💥 Powerful LLM's just got faster! - Anyone can try @MosaicML 's new MPT model on their desktop! No GPU required! - Runs on Windows/Mac/Ubuntu Try it at:
14
79
363
@nomic_ai
Nomic AI
6 months
Private and Local Chat with Your Data is here! Use any Local LLM with GPT4All LocalDocs to chat with your large collections of PDFs, docx files including financial documents! - No internet required - Supports 40 filetypes! - CPU and GPU Try LocalDocs in
9
56
362
@nomic_ai
Nomic AI
6 months
Monthly reminder for everyone affected by today's @OpenAI outage: Local #GPT4All models like @MistralAI 7B run at 20tokens/sec+ on a Macbook air and don't go down.
Tweet media one
10
34
304
@nomic_ai
Nomic AI
11 months
The first GPT4All powered code copilot has launched🖥️ @morph_labs allows you to use the recently released Replit GPT4All model on Apple Metal to perform privacy aware - Code completion (23 tok/second) - Chatting and asking questions all through the Rift VSCode extension. Local…
@morph_labs
Morph
11 months
The future of AI code assistants is open-source, private, secure, and on-device. That future starts today. We’re excited to release Rift, an open-source AI-native language server and VSCode extension for local copilots.
27
215
1K
6
57
301
@nomic_ai
Nomic AI
10 months
We’ve just raised a $17m Series A round led by @Coatue to build explainable and accessible AI. Join us: Here is what this means for the future of AI: 🧵
Tweet media one
15
42
273
@nomic_ai
Nomic AI
11 months
Local LLMs in GPT4All are now 2x faster on Apple Silicone ⚡ - Supports all LLaMa models - Exclusive support of the Replit model for 23 tok/s code generation enabling local Copilot! Watch the 13B parameter Hermes model run at 15 tok/s locally!
12
42
248
@nomic_ai
Nomic AI
1 year
What are the latest research trends in AI? Explore all NeurIPS submissions from 1987 to 2022 in Atlas.
3
77
233
@nomic_ai
Nomic AI
1 year
What goes viral on Twitter? Share what you find in our new map of the top 5.4M retweeted tweets. How it works 👇
5
48
221
@nomic_ai
Nomic AI
7 months
Releasing GPT4All v2.5.0 with GGUF Support - Runs @MistralAI 7B Locally with Vulkan GPU Support - Universal GPU Inference: Mistral, LLaMa, MPT, Falcon in Chat Client and Python - Generate Embed4All Embeddings on GPU. See release notes at
6
38
211
@nomic_ai
Nomic AI
2 months
Atlas Capability Announcement: Scalable Duplicate Detection 🍡 - Deduplicate your text, image and embedding datasets in your web browser. - Scales to millions of datapoints (e.g. English Wikipedia) - Cross correlate with real-time regex search and semantic lasso's.
4
33
201
@nomic_ai
Nomic AI
11 months
Local LLMs now support Falcon and Orca 🕊️🐳 GPT4All now supports - @TIIuae Falcon model on any CPU device - OpenLLaMA models (Orca) on Apple Metal
6
31
171
@nomic_ai
Nomic AI
1 year
that escalated quickly:
Tweet media one
1
16
165
@nomic_ai
Nomic AI
2 years
Explorable Map of @Wikipedia . Published at VisXAI. #ieeevis
2
61
167
@nomic_ai
Nomic AI
1 year
First, we collected a training dataset of 1 million prompt-response pairs from GPT-3.5-Turbo on a variety of topics. We are publicly releasing all of this data alongside GPT4All.
3
18
152
@nomic_ai
Nomic AI
9 months
Interact with 11M multimodal embeddings 🗺️ OBELICS: The training set of @huggingface 's new multimodal model IDEFICS.
Tweet media one
1
41
143
@nomic_ai
Nomic AI
2 months
New #GPT4All release - instantly search, download, and privately chat with any model on @huggingface !
4
23
144
@nomic_ai
Nomic AI
8 months
Run LLMs on Any GPU with GPT4All ⚡ - Supports all modern @AMD , @intel , @Qualcomm and @nvidia GPUs for quantized LLM inference. - Faster than OpenCL on modern Nvidia GPUs - Works out of the box on Windows, OSX and Linux. Details below.
3
35
139
@nomic_ai
Nomic AI
3 months
Open source models are not replicable unless you have access to their training data. We release our training dataset of 235M curated text pairs to enable anyone to replicate Nomic Embed from scratch. Blog:
Tweet media one
2
22
138
@nomic_ai
Nomic AI
2 months
btw - you can just do >0 on the nomic-embed-text-v1.5 outputs to get binary embeddings that maintain 90%+ of the FP32 MTEB performance ;) get the model on @huggingface :
@Nils_Reimers
Nils Reimers
2 months
🚀 𝐂𝐨𝐡𝐞𝐫𝐞 𝐄𝐦𝐛𝐞𝐝 𝐕𝟑 - 𝐢𝐧𝐭𝟖 & 𝐛𝐢𝐧𝐚𝐫𝐲 𝐒𝐮𝐩𝐩𝐨𝐫𝐭🚀 I'm excited to launch our native support for int8 & binary embeddings for Cohere Embed V3. They slash your vector DB cost 4x - 32x while keeping 95% - 100% of the search quality.
Tweet media one
14
73
437
2
21
105
@nomic_ai
Nomic AI
20 days
run @Meta 's llama3 privately on your machine with #gpt4all . try it now at
3
25
105
@nomic_ai
Nomic AI
3 months
New Release GPT4All: v2.7.0 This version has support for a wide range of new model architectures as well as many bug fixes. - Baichuan, BLOOM, CodeShell, GPT-2, Orion, Persimmon, Phi and Phi-2, Plamo, Qwen, Qwen2, Refact, and StableLM
Tweet media one
5
9
88
@nomic_ai
Nomic AI
1 year
lil bit of prompt leak?
Tweet media one
1
0
81
@nomic_ai
Nomic AI
10 months
GPT4All now support Wizard 1.1 on Mac Metal @WizardLM_AI
Tweet media one
2
15
75
@nomic_ai
Nomic AI
1 year
Next, we used Atlas to curate the data. We removed low diversity responses, and ensured that the training data covered a variety of topics. Explore the full train set on Atlas:
Tweet media one
1
8
72
@nomic_ai
Nomic AI
3 months
You can find the model on @huggingface : The easiest way to use Nomic Embed in a managed service is through the Nomic Embedding API:
2
10
67
@nomic_ai
Nomic AI
3 months
Model details: - 137M parameters for easy deployment - 5 days of 8xH100 time to train - Code and data: - Detailed Technical Report:
Tweet media one
1
9
66
@nomic_ai
Nomic AI
6 months
What can you learn about a multimodal LLM from its training data? Read about how @huggingface evaluated and improved their multimodal LLM IDEFICS with Nomic Atlas
1
18
62
@nomic_ai
Nomic AI
1 year
#gpt4all + @Replit = giving a voice to the next billion creators
@chillzaza_
Zahid Khawaja
1 year
Running open source LLMs on @Replit feels like magic 🪄 Here's a demo of GPT4All 🦙 by @nomic_ai running inside my Repl:
60
85
583
2
10
60
@nomic_ai
Nomic AI
1 year
normal response:
Tweet media one
2
0
55
@nomic_ai
Nomic AI
1 year
We then benched our trained model against the best open source alpaca-lora we could find on @huggingface (tloen/alpaca-lora-7b by @ecjwg ). Our model achieves consistently lower perplexity!
Tweet media one
1
2
46
@nomic_ai
Nomic AI
1 year
Nomic is proud to support efforts democratizing access to large language models. We believe that open source models are critical to advancing AI research, particularly in the fields of AI interpretability and alignment.
1
1
45
@nomic_ai
Nomic AI
1 year
One of our discord community members benched #GPT4All on trivia, and it ended up beating all other models, including GPT-3.5! Check out the evaluation here:
Tweet media one
3
5
47
@nomic_ai
Nomic AI
1 year
#GPT4All is now in langchain! Open source stands together.
@LangChainAI
LangChain
1 year
Rather large 🦜🔗0.0.131 release! 🆓GPT4all model ( @nomic_ai ) 🦙Llama-cpp model ⏹️Support for @qdrant_engine local db 🌲Zilliz cloud ( @milvusio ) Vectorstore support 📧New OutlookMessage Document Loader 🕸️New Selenium Document Loader 🪟 Support for SQL views in SQLChain 🧵
12
84
590
3
9
46
@nomic_ai
Nomic AI
3 months
Nomic Embed v1.5 is on @huggingface and compatible with SentenceTransformers:
3
7
43
@nomic_ai
Nomic AI
3 months
Native GPT4All Integration Chat with your data locally powered by Nomic Embed.
Tweet media one
3
4
41
@nomic_ai
Nomic AI
1 year
herewego 🚀
Tweet media one
1
4
41
@nomic_ai
Nomic AI
1 year
The bravest souls at the NYC gen AI hackathon - 4am and still going strong
Tweet media one
2
7
38
@nomic_ai
Nomic AI
1 year
This project would not be possible without the incredible effort of @Yuvaaa___ , @zach_nussbaum , @benmschmidt , and @andriy_mulyar ! Read our full technical report here:
3
1
37
@nomic_ai
Nomic AI
11 months
LocalDocs enables any GPT4All model to cite its sources. When GPT4All decides that it can improve response factuality by using your documents it does so and tells you which documents it used.
Tweet media one
3
3
35
@nomic_ai
Nomic AI
1 year
Lots more to come from local chatbots. Also hiring very soon 👀👀👀
@andriy_mulyar
AndriyMulyar
1 year
Announcing GPT4All-J: The First Apache-2 Licensed Chatbot That Runs Locally on Your Machine💥 Large Language Models must be democratized and decentralized.
84
627
3K
5
2
34
@nomic_ai
Nomic AI
3 months
Embedding evaluation is broken. Benchmarks like MTEB are not sufficient for capturing all aspects of model behavior. You can discover systematic differences in model embedding spaces using Nomic Atlas Comparing nomic-embed-text-v1 and OpenAI Ada 002 embeddings.…
5
2
35
@nomic_ai
Nomic AI
10 months
We are proud to announce today that we are partnering with @HuggingFace , the north star of open source machine learning. Together, we are creating and distributing rich, interactive data visualizations to help everyone understand the data going into their AI systems.
1
4
34
@nomic_ai
Nomic AI
1 year
Awesome competition! If GPT4All ends up winning, Nomic will spend the $1M on the greatest hackathon ever thrown. All open source contributors invited.
@chai_research
CHAI: AI Platform
1 year
Announcing the Guanaco LLM Challenge $1 million cash prize, starts June 10th 2023. Find out more at #hacking #aicompetition
2
17
78
2
3
33
@nomic_ai
Nomic AI
9 months
Explore Multimodal Embeddings with Nomic AI and Google Vertex ( @googlecloud )
Tweet media one
1
6
34
@nomic_ai
Nomic AI
4 months
How do you get 10 million documents of text ready for generative AI model training? Join the first episode of the Nomic Atlas Webinar Series to learn! January 19th, 12PM EST
1
6
33
@nomic_ai
Nomic AI
1 year
Your data never leaves your machine! The HTTP server runs on port 4981 (1984 in reverse)! See it in action and own your large language models!
4
5
32
@nomic_ai
Nomic AI
3 months
Day 1 Integrations: - Build a RAG app with Nomic Embed, @MongoDB and @NousResearch : - Build a fully open retriever with Nomic Embed and @llamaindex : - Integrated with @langchain
1
1
32
@nomic_ai
Nomic AI
1 year
@gdb Traditional methods for manual inspection of unstructured data are tedious - a web interface that shows you all of your data pre-organized makes it easy. This is exactly why we are focused on scaling Atlas to support internet scale datasets. Better data curation = better ML.
Tweet media one
1
1
32
@nomic_ai
Nomic AI
10 months
Under the hood, GPT4All now supports the contrastively trained models for inference. This enables anyone to deploy powerful text embeddings models at no cost.
3
2
29
@nomic_ai
Nomic AI
1 year
Download, try-out and build with the best local LLM's at @MosaicML 's MPT model is now a powerful open-source option! Software ecosystem:
Tweet media one
3
6
30
@nomic_ai
Nomic AI
11 months
Install the universal local LLM client from , go to settings and enable the plugin! Documentation: You will soon be able to use LocalDocs in server mode allowing you to easily augment any LLM with your private data.
1
3
30
@nomic_ai
Nomic AI
2 years
interactive map of #iclr2023 submissions 👀 @hippopedoid
Tweet media one
7
11
28
@nomic_ai
Nomic AI
3 months
Interested in learning more about Nomic Embed, the first truly open source model to outperform OpenAI? Nomic sat down with @mattturck on @FirstMarkCap 's MAD podcast to talk all things Nomic, Atlas, and Embed.
2
6
27
@nomic_ai
Nomic AI
3 months
We also launch the Nomic Embedding API - 1M Free tokens! - Production ready embedding inference API including task specific embedding customizations. - Deep integration with Atlas Datasets - New models incoming 👀 Sign up at
Tweet media one
1
0
25
@nomic_ai
Nomic AI
1 year
clean data > big data
@graceisford
Grace Isford
1 year
and would add -- *where* the data is coming from - need more good data for training beyond what's on web
1
0
10
5
1
23
@nomic_ai
Nomic AI
11 months
Check out all of the growing software ecosystem for local LLMs at Run GPT4All in: - Python - Typescript - Golang - C# and .NET
3
2
24
@nomic_ai
Nomic AI
2 years
@krea_ai How it works 👇 Every point is a user-generated image and its prompt. Points are close together if an AI considers their images similar. For example, Billionaires Row is a region containing co-located generations of @elonmusk , Jeff Bezos, Mark Zuckerburg and US dollars.
3
1
24
@nomic_ai
Nomic AI
1 year
Excited to see #gpt4all as the top project on this list! Thrilled to keep building this community with awesome open source companies like @huggingface . Congrats on the milestone!
@huggingface
Hugging Face
1 year
🤗 Transformers has been built by, with, and for the community. Reaching 100k ⭐ on GitHub is a testament to ML's reach and the community's will to innovate and contribute. To celebrate, we highlight 100 incredible projects in transformers' vicinity.
Tweet media one
99
271
1K
0
2
24
@nomic_ai
Nomic AI
11 months
Run local LLMs on CPU in Java with GPT4All ☕ credit to elite hacker: @FZaslavskiy
Tweet media one
0
6
22
@nomic_ai
Nomic AI
1 month
shout out @AntimetalCloud for the free tier slices as a service. also shout out @dingboard_ for the mug.
Tweet media one
0
0
23
@nomic_ai
Nomic AI
11 months
To make this possible, GPT4All hackers had to implement several custom Apple Metal kernels for LLM ops (e.g. Alibi) and support a custom fork llama.cpp! Excited to get these changes upstream!
1
6
22
@nomic_ai
Nomic AI
3 months
The input token embedding space of nomic-embed-text-v1
0
4
22
@nomic_ai
Nomic AI
6 months
GPT4All will auto-index your arbitrary corpus of documents with a simple desktop GUI allowing you to securely and privately chat with your data. Switch between any local LLM including @MistralAI ! Learn more in the documentation:
1
3
21
@nomic_ai
Nomic AI
11 months
Excited to have our Falcon ggML implementation included in llama.cpp! Kudos to elite hacker @apage43
@TheBlokeAI
Tom Jobbins
11 months
We have first Falcon 40B GGML support! Thanks to the amazing efforts of @apage43 , Jan Ploski et al at Support is *experimental*. Won't work with UIs etc. Here's two Falcon 40B models in GGML: Pls read README!
4
29
203
1
2
19
@nomic_ai
Nomic AI
22 days
Want to hear what's in store for local AI in 2024? Join the GPT4All 2024 Townhall tomorrow at 12pm EST.
Tweet media one
1
3
18
@nomic_ai
Nomic AI
11 months
@jayecreates The larger models in the GPT4All ecosystem are not to bad at coding assistance. For code writing, we have optimized replits model to run on CPU and that will be joining the ecosystem when this PR is tested on all operating systems and merges.
3
3
20
@nomic_ai
Nomic AI
11 months
- You can side-load almost any local LLM (GPT4All supports more than just LLaMa) - Everything runs on CPU - yes it works on your computer! - Dozens of developers actively working on it squash bugs on all operating systems and improve the speed and quality of models
9
0
20
@nomic_ai
Nomic AI
1 year
Air gap the chatbots.
@JayaGup10
Jaya Gupta
1 year
💀
Tweet media one
13
13
93
0
0
19
@nomic_ai
Nomic AI
8 months
1
2
19
@nomic_ai
Nomic AI
1 year
Spot on.
@Dorialexander
Alexander Doria
1 year
So it was high time for a new update of the political compass of AI. Now featuring Alpaca (Stanford), @nomic_ai with GPT4All and BloombergGPT
Tweet media one
3
14
53
1
4
16
@nomic_ai
Nomic AI
29 days
GPT4All Townhall Announcement: 2024 Roadmap Date: April 18th, 12pm EST - Outline and discuss the 2024 Roadmap - Opportunity for preview and feedback on GPT4All's next generation local LLM interface.
Tweet media one
2
2
18
@nomic_ai
Nomic AI
10 months
@FardeemM OpenAI is usually worse in both resource usage and performance (e.g. retrieval) Source: the author of SBERT who is now at @cohere
@Nils_Reimers
Nils Reimers
2 years
GPT-3 Embeddings by @OpenAI was announced this week. 📈 I was excited and tested them on 20 datasets 😢 Sadly they are worse than open models that are 1000 x smaller 💰 Running @OpenAI models can be a 1 million times more expensive
Tweet media one
Tweet media two
40
438
2K
3
0
18
@nomic_ai
Nomic AI
11 months
Previously, constant breaking changes to llama.cpp made it difficult to find software that works with any off-the-shelf local LLM. The GPT4All ecosystem will now dynamically load the right versions without any intervention! LLMs should *just work*!
2
1
17
@nomic_ai
Nomic AI
10 months
Our product GPT4All let's anyone run powerful large language models on resource constrained devices. With this new funding, we are doubling down on our commitment to open-source and excited to accelerate work with our industry partners such as open-source giant @MongoDB .
2
1
16
@nomic_ai
Nomic AI
11 months
GPT4All lets you run 13 officially supported models and side load hundreds from @huggingface . The following architectures are supported in GGML. - LLaMA - MPT - Falcon
Tweet media one
0
3
16
@nomic_ai
Nomic AI
6 months
Figure 3: The training set of an open-source multimodal LLM. featuring model creator: @SanhEstPasMoi
Tweet media one
0
1
16
@nomic_ai
Nomic AI
27 days
nomic embed text 1.5 has done this since feb
@mixedbreadai
mixedbreadai
27 days
Follow-up on binary embeddings: 64 bytes per embedding, yee-haw 🤠 Reduces memory usage of our embedding model by more than 98% (64x) while retaining over 90% of model performance with binary 🪆 Model: Blog:
2
22
107
2
2
16
@nomic_ai
Nomic AI
1 year
Cool to see the Nomic shout out in @StabilityAI 's StableLM readme! I do hope they didn't fine tune directly on the chosen responses of the @AnthropicAI HH dataset though...
Tweet media one
@StabilityAI
Stability AI
1 year
Announcing StableLM❗ We’re releasing the first of our large language models, starting with 3B and 7B param models, with 15-65B to follow. Our LLMs are released under CC BY-SA license. We’re also releasing RLHF-tuned models for research use. Read more→
Tweet media one
73
930
4K
1
2
15
@nomic_ai
Nomic AI
11 months
Huge shout out to @amasad and the team @Replit for building and open sourcing the base model!
0
0
15
@nomic_ai
Nomic AI
10 months
Current AI technology is concentrated in the hands of a very small number of companies with outsized access to compute resources. We are changing that: Powerful models should be accessible to all: The tools to build them as well:
2
2
15
@nomic_ai
Nomic AI
2 years
Large Language Models like #T5 and #GPT3 are evaluated on benchmark datasets such as (super) GLUE. But what's in them? Explore and search the contents of the GLUE benchmark with Atlas. QNLI: @sleepinyourhat
1
2
15
@nomic_ai
Nomic AI
3 months
@EssentialBusin7 @llama @MongoDB Each dot is a piece of data. Two dots are close if they have similar embeddings. At Nomic we believe that it is organic - embeddings are a function of meaning, meaning is a function of culture, and culture is a function of organic animal behavior.
2
0
14