Nomic AI @nomic_ai Twitter profile

Pinned Tweet

Nomic AI

3 months

Announcing Nomic Embed v1.5 🪆🪆🪆 - Variable sized embeddings with matryoshka learning and an 8192 context. - Outperforms OpenAI text-embedding-3-small across output sizes. - Open source, open training code, open data. Day 0 in @LangChainAI , @llama_index and @MongoDB

11

84

476

Last Seen Profiles

@Robstarkenji

@bundacitraa

@Megatronic13

@QuadfatherD

@DopeTipsyRox

@enigmas100

@Middy_Moony

@TedTalksDrag

@DrDanielRhind

@UncannyMagazine

@KristenCarney

@vijaythottathil

@Crbon6

@GeneticCuckoo

@WorkingBear1

@LisaHaw18237839

@Ms_Silva117

@IRLVinnieDakota

@runawa777173440

@awakhlu

@jnkgrandes

@Agness_Cuck

@De_Cymru

@PlanningShit

@ChrisParksWBTW

@NotHerAgainDamn

@BrawlStars

@Z8yGm

@nowalskie

@WolfyLion

@Luigii_dad

@NomikiKonst

@deLongeMVP

@Pemuasukhtyyy

@Justitiedep

@alinalal_

Nomic AI

@nomic_ai

1 year

Today we're releasing GPT4All, an assistant-style chatbot distilled from 430k GPT-3.5-Turbo outputs that you can run on your laptop.

44

348

2K

Nomic AI

@nomic_ai

3 months

Introducing Nomic Embed - the first fully open long context text embedder to beat OpenAI - Open source, open weights, open data - Beats OpenAI text-embeding-3-small and Ada on short and long context benchmarks - Day 1 integrations with @langchain , @llama -index, @MongoDB

38

276

2K

Nomic AI

@nomic_ai

1 year

first?

23

140

1K

Nomic AI

@nomic_ai

11 months

Local LLMs now have plugins! 💥 GPT4All LocalDocs allows you chat with your private data! - Drag and drop files into a directory that GPT4All will query for context when answering questions. - Supports 40+ filetypes - Cites sources.

43

181

820

Nomic AI

@nomic_ai

10 months

GPT4All now supports Text Embeddings ⚡ - Generate text embeddings of arbitrary length documents for free on CPU at 8,000 tok/second. - No external dependencies except C.

23

145

674

Nomic AI

@nomic_ai

2 years

How do people use #stablediffusion ? Explore 6.4 million AI generated images from @krea_ai in Atlas.

13

163

630

Nomic AI

@nomic_ai

1 year

Big New Release of GPT4All📶 You can now use local CPU-powered LLMs through a familiar API! Building with a local LLM is as easy as a 1 line code change! Simply spin up the chat app at and place it in server mode! Documentation:

19

145

605

Nomic AI

@nomic_ai

11 months

GPT4All now supports 100+ more models!💥 Nearly every custom ggML model you find @huggingface for CPU inference will *just work* with all GPT4All software with the newest release! Instructions:

10

92

413

Nomic AI

@nomic_ai

1 year

Huge Release of GPT4All 💥 Powerful LLM's just got faster! - Anyone can try @MosaicML 's new MPT model on their desktop! No GPU required! - Runs on Windows/Mac/Ubuntu Try it at:

14

79

363

Nomic AI

@nomic_ai

6 months

Private and Local Chat with Your Data is here! Use any Local LLM with GPT4All LocalDocs to chat with your large collections of PDFs, docx files including financial documents! - No internet required - Supports 40 filetypes! - CPU and GPU Try LocalDocs in

9

56

362

Nomic AI

@nomic_ai

6 months

Monthly reminder for everyone affected by today's @OpenAI outage: Local #GPT4All models like @MistralAI 7B run at 20tokens/sec+ on a Macbook air and don't go down.

10

34

304

Nomic AI

@nomic_ai

11 months

The first GPT4All powered code copilot has launched🖥️ @morph_labs allows you to use the recently released Replit GPT4All model on Apple Metal to perform privacy aware - Code completion (23 tok/second) - Chatting and asking questions all through the Rift VSCode extension. Local…

Morph

@morph_labs

11 months

The future of AI code assistants is open-source, private, secure, and on-device. That future starts today. We’re excited to release Rift, an open-source AI-native language server and VSCode extension for local copilots.

27

215

1K

6

57

301

Nomic AI

@nomic_ai

10 months

We’ve just raised a $17m Series A round led by @Coatue to build explainable and accessible AI. Join us: Here is what this means for the future of AI: 🧵

15

42

273

Nomic AI

@nomic_ai

11 months

Local LLMs in GPT4All are now 2x faster on Apple Silicone ⚡ - Supports all LLaMa models - Exclusive support of the Replit model for 23 tok/s code generation enabling local Copilot! Watch the 13B parameter Hermes model run at 15 tok/s locally!

12

42

248

Nomic AI

@nomic_ai

1 year

What are the latest research trends in AI? Explore all NeurIPS submissions from 1987 to 2022 in Atlas.

3

77

233

Nomic AI

@nomic_ai

1 year

What goes viral on Twitter? Share what you find in our new map of the top 5.4M retweeted tweets. How it works 👇

5

48

221

Nomic AI

@nomic_ai

7 months

Releasing GPT4All v2.5.0 with GGUF Support - Runs @MistralAI 7B Locally with Vulkan GPU Support - Universal GPU Inference: Mistral, LLaMa, MPT, Falcon in Chat Client and Python - Generate Embed4All Embeddings on GPU. See release notes at

6

38

211

Nomic AI

@nomic_ai

2 months

Atlas Capability Announcement: Scalable Duplicate Detection 🍡 - Deduplicate your text, image and embedding datasets in your web browser. - Scales to millions of datapoints (e.g. English Wikipedia) - Cross correlate with real-time regex search and semantic lasso's.

4

33

201

Nomic AI

@nomic_ai

11 months

Local LLMs now support Falcon and Orca 🕊️🐳 GPT4All now supports - @TIIuae Falcon model on any CPU device - OpenLLaMA models (Orca) on Apple Metal

6

31

171

Nomic AI

@nomic_ai

1 year

that escalated quickly:

1

16

165

Nomic AI

@nomic_ai

2 years

Explorable Map of @Wikipedia . Published at VisXAI. #ieeevis

2

61

167

Nomic AI

@nomic_ai

1 year

First, we collected a training dataset of 1 million prompt-response pairs from GPT-3.5-Turbo on a variety of topics. We are publicly releasing all of this data alongside GPT4All.

GitHub - nomic-ai/gpt4all: gpt4all: run open-source LLMs anywhere

gpt4all: run open-source LLMs anywhere. Contribute to nomic-ai/gpt4all development by creating an account on GitHub.

github.com

3

18

152

Nomic AI

@nomic_ai

9 months

Interact with 11M multimodal embeddings 🗺️ OBELICS: The training set of @huggingface 's new multimodal model IDEFICS.

1

41

143

Nomic AI

@nomic_ai

2 months

New #GPT4All release - instantly search, download, and privately chat with any model on @huggingface !

4

23

144

Nomic AI

@nomic_ai

8 months

Run LLMs on Any GPU with GPT4All ⚡ - Supports all modern @AMD , @intel , @Qualcomm and @nvidia GPUs for quantized LLM inference. - Faster than OpenCL on modern Nvidia GPUs - Works out of the box on Windows, OSX and Linux. Details below.

Run LLMs on Any GPU: GPT4All Universal GPU Support

Nomic AI releases support for edge LLM inference on all AMD, Intel, Samsung, Qualcomm and Nvidia GPU's in GPT4All.

blog.nomic.ai

3

35

139

Nomic AI

@nomic_ai

3 months

Open source models are not replicable unless you have access to their training data. We release our training dataset of 235M curated text pairs to enable anyone to replicate Nomic Embed from scratch. Blog:

2

22

138

Nomic AI

@nomic_ai

2 months

btw - you can just do >0 on the nomic-embed-text-v1.5 outputs to get binary embeddings that maintain 90%+ of the FP32 MTEB performance ;) get the model on @huggingface :

nomic-ai/nomic-embed-text-v1.5 · Hugging Face

huggingface.co

Nils Reimers

@Nils_Reimers

2 months

🚀 𝐂𝐨𝐡𝐞𝐫𝐞 𝐄𝐦𝐛𝐞𝐝 𝐕𝟑 - 𝐢𝐧𝐭𝟖 & 𝐛𝐢𝐧𝐚𝐫𝐲 𝐒𝐮𝐩𝐩𝐨𝐫𝐭🚀 I'm excited to launch our native support for int8 & binary embeddings for Cohere Embed V3. They slash your vector DB cost 4x - 32x while keeping 95% - 100% of the search quality.

14

73

437

2

21

105

Nomic AI

@nomic_ai

20 days

run @Meta 's llama3 privately on your machine with #gpt4all . try it now at

3

25

105

Nomic AI

@nomic_ai

3 months

New Release GPT4All: v2.7.0 This version has support for a wide range of new model architectures as well as many bug fixes. - Baichuan, BLOOM, CodeShell, GPT-2, Orion, Persimmon, Phi and Phi-2, Plamo, Qwen, Qwen2, Refact, and StableLM

5

9

88

Nomic AI

@nomic_ai

1 year

lil bit of prompt leak?

1

0

81

Nomic AI

@nomic_ai

10 months

GPT4All now support Wizard 1.1 on Mac Metal @WizardLM_AI

2

15

75

Nomic AI

@nomic_ai

1 year

Next, we used Atlas to curate the data. We removed low diversity responses, and ensured that the training data covered a variety of topics. Explore the full train set on Atlas:

1

8

72

Nomic AI

@nomic_ai

3 months

You can find the model on @huggingface : The easiest way to use Nomic Embed in a managed service is through the Nomic Embedding API:

Text Embeddings | Nomic Atlas Documentation

Nomic Text Embeddings allow you to semantically encode text for computers to manipulate. They are useful

docs.nomic.ai

2

10

67

Nomic AI

@nomic_ai

3 months

Model details: - 137M parameters for easy deployment - 5 days of 8xH100 time to train - Code and data: - Detailed Technical Report:

1

9

66

Nomic AI

@nomic_ai

6 months

What can you learn about a multimodal LLM from its training data? Read about how @huggingface evaluated and improved their multimodal LLM IDEFICS with Nomic Atlas

Evaluating Hugging Face's Multimodal IDEFICS model with Atlas

Evaluating Hugging Face's Multimodal IDEFICS Model with Atlas

blog.nomic.ai

1

18

62

Nomic AI

@nomic_ai

1 year

#gpt4all + @Replit = giving a voice to the next billion creators

Zahid Khawaja

@chillzaza_

1 year

Running open source LLMs on @Replit feels like magic 🪄 Here's a demo of GPT4All 🦙 by @nomic_ai running inside my Repl:

60

85

583

2

10

60

Nomic AI

@nomic_ai

1 year

normal response:

2

0

55

Nomic AI

@nomic_ai

1 year

We then benched our trained model against the best open source alpaca-lora we could find on @huggingface (tloen/alpaca-lora-7b by @ecjwg ). Our model achieves consistently lower perplexity!

1

2

46

Nomic AI

@nomic_ai

1 year

Nomic is proud to support efforts democratizing access to large language models. We believe that open source models are critical to advancing AI research, particularly in the fields of AI interpretability and alignment.

1

45

Nomic AI

@nomic_ai

1 year

One of our discord community members benched #GPT4All on trivia, and it ended up beating all other models, including GPT-3.5! Check out the evaluation here:

3

5

47

Nomic AI

@nomic_ai

11 months

Nomic embedding interaction technology has been featured in @ScienceMagazine

Atlas of biomedical literature could help track down fabricated studies

Bird’s-eye view of 21 million papers presents new way to visually analyze trends in science

www.science.org

0

10

47

Nomic AI

@nomic_ai

1 year

#GPT4All is now in langchain! Open source stands together.

LangChain

@LangChainAI

1 year

Rather large 🦜🔗0.0.131 release! 🆓GPT4all model ( @nomic_ai ) 🦙Llama-cpp model ⏹️Support for @qdrant_engine local db 🌲Zilliz cloud ( @milvusio ) Vectorstore support 📧New OutlookMessage Document Loader 🕸️New Selenium Document Loader 🪟 Support for SQL views in SQLChain 🧵

12

84

590

3

9

46

Nomic AI

@nomic_ai

3 months

Nomic Embed v1.5 is on @huggingface and compatible with SentenceTransformers:

nomic-ai/nomic-embed-text-v1.5 · Hugging Face

huggingface.co

3

7

43

Nomic AI

@nomic_ai

3 months

Native GPT4All Integration Chat with your data locally powered by Nomic Embed.

3

4

41

Nomic AI

@nomic_ai

1 year

herewego 🚀

1

4

41

Nomic AI

@nomic_ai

1 year

The bravest souls at the NYC gen AI hackathon - 4am and still going strong

2

7

38

Nomic AI

@nomic_ai

1 year

This project would not be possible without the incredible effort of @Yuvaaa___ , @zach_nussbaum , @benmschmidt , and @andriy_mulyar ! Read our full technical report here:

3

1

37

Nomic AI

@nomic_ai

11 months

LocalDocs enables any GPT4All model to cite its sources. When GPT4All decides that it can improve response factuality by using your documents it does so and tells you which documents it used.

3

35

Nomic AI

@nomic_ai

1 year

Official Node bindings for GPT4All! Use any local LLM in NodeJS!

nodejs bindings by jacoobes · Pull Request #602 · nomic-ai/gpt4all

Describe your changes Issue ticket number and link Checklist before requesting a review I have performed a self-review of my code. If it is a core feature, I have added thorough tests. I have a...

github.com

3

5

34

Nomic AI

@nomic_ai

1 year

Lots more to come from local chatbots. Also hiring very soon 👀👀👀

AndriyMulyar

@andriy_mulyar

1 year

Announcing GPT4All-J: The First Apache-2 Licensed Chatbot That Runs Locally on Your Machine💥 Large Language Models must be democratized and decentralized.

84

627

3K

5

2

34

Nomic AI

@nomic_ai

3 months

Embedding evaluation is broken. Benchmarks like MTEB are not sufficient for capturing all aspects of model behavior. You can discover systematic differences in model embedding spaces using Nomic Atlas Comparing nomic-embed-text-v1 and OpenAI Ada 002 embeddings.…

5

2

35

Nomic AI

@nomic_ai

10 months

We are proud to announce today that we are partnering with @HuggingFace , the north star of open source machine learning. Together, we are creating and distributing rich, interactive data visualizations to help everyone understand the data going into their AI systems.

1

4

34

Nomic AI

@nomic_ai

1 year

Awesome competition! If GPT4All ends up winning, Nomic will spend the $1M on the greatest hackathon ever thrown. All open source contributors invited.

CHAI: AI Platform

@chai_research

1 year

Announcing the Guanaco LLM Challenge $1 million cash prize, starts June 10th 2023. Find out more at #hacking #aicompetition

2

17

78

2

3

33

Nomic AI

@nomic_ai

3 months

We're excited to announce that the Nomic Vulkan backend is now merged into @ggerganov 's llama.cpp under an MIT license! Run open-source LLMs on nearly any GPU.

Nomic Vulkan backend by cebtenzzre · Pull Request #4456 · ggerganov/llama.cpp

This is Nomic's Kompute-based Vulkan backend from the GPT4All project, now available under the MIT license. It can be enabled by building with cmake and passing -DLLAMA_KOMPUTE=ON (make is curr...

github.com

1

5

34

Nomic AI

@nomic_ai

9 months

Explore Multimodal Embeddings with Nomic AI and Google Vertex ( @googlecloud )

1

6

34

Nomic AI

@nomic_ai

4 months

How do you get 10 million documents of text ready for generative AI model training? Join the first episode of the Nomic Atlas Webinar Series to learn! January 19th, 12PM EST

Nomic Atlas Webinar Series: 10 Million Documents of Text

Register now for Nomic Atlas Webinar Series: 10 Million Documents of Text on crowdcast, scheduled to go live on January 19, 2024, 09:00 AM PST.

www.crowdcast.io

1

6

33

Nomic AI

@nomic_ai

1 year

Your data never leaves your machine! The HTTP server runs on port 4981 (1984 in reverse)! See it in action and own your large language models!

4

5

32

Nomic AI

@nomic_ai

3 months

Day 1 Integrations: - Build a RAG app with Nomic Embed, @MongoDB and @NousResearch : - Build a fully open retriever with Nomic Embed and @llamaindex : - Integrated with @langchain

NomicEmbeddings | 🦜️🔗 LangChain

This notebook covers how to get started with Nomic embedding models.

python.langchain.com

1

32

Nomic AI

@nomic_ai

1 year

@gdb Traditional methods for manual inspection of unstructured data are tedious - a web interface that shows you all of your data pre-organized makes it easy. This is exactly why we are focused on scaling Atlas to support internet scale datasets. Better data curation = better ML.

1

32

Nomic AI

@nomic_ai

3 months

Releasing the Official Guide on Nomic Embeddings:

Nomic Embedding Guide | Nomic Atlas Documentation

Official guide on using Nomic Embedding Foundation models

docs.nomic.ai

2

5

32

Nomic AI

@nomic_ai

10 months

Under the hood, GPT4All now supports the contrastively trained models for inference. This enables anyone to deploy powerful text embeddings models at no cost.

3

2

29

Nomic AI

@nomic_ai

1 year

Download, try-out and build with the best local LLM's at @MosaicML 's MPT model is now a powerful open-source option! Software ecosystem:

3

6

30

Nomic AI

@nomic_ai

11 months

Install the universal local LLM client from , go to settings and enable the plugin! Documentation: You will soon be able to use LocalDocs in server mode allowing you to easily augment any LLM with your private data.

1

3

30

Nomic AI

@nomic_ai

2 years

interactive map of #iclr2023 submissions 👀 @hippopedoid

7

11

28

Nomic AI

@nomic_ai

3 months

Interested in learning more about Nomic Embed, the first truly open source model to outperform OpenAI? Nomic sat down with @mattturck on @FirstMarkCap 's MAD podcast to talk all things Nomic, Atlas, and Embed.

How Nomic AI Is Driving The Open Source Revolution

In this episode, Brandon Duderstadt, CEO + Co-Founder, and Zach Nussbaum, ML Engineer at Nomic, unveil their latest product - Nomic Embed - the first fully o...

www.youtube.com

2

6

27

Nomic AI

@nomic_ai

3 months

We also launch the Nomic Embedding API - 1M Free tokens! - Production ready embedding inference API including task specific embedding customizations. - Deep integration with Atlas Datasets - New models incoming 👀 Sign up at

1

0

25

Nomic AI

@nomic_ai

1 year

Learn how to prompt engineer local LLMs!

Discussion: Prompting / Prompt Engineering · Issue #631 · nomic-ai/gpt4all

Issue you'd like to raise. [Note: this is intended to be a discussion rather than an issue with the codebase] As you guys probably all know, it's sometimes hard to get consistent or even co...

github.com

0

2

25

Nomic AI

@nomic_ai

1 year

clean data > big data

Grace Isford

@graceisford

1 year

and would add -- *where* the data is coming from - need more good data for training beyond what's on web

1

0

10

5

1

23

Nomic AI

@nomic_ai

11 months

Check out all of the growing software ecosystem for local LLMs at Run GPT4All in: - Python - Typescript - Golang - C# and .NET

GitHub - nomic-ai/gpt4all: gpt4all: run open-source LLMs anywhere

gpt4all: run open-source LLMs anywhere. Contribute to nomic-ai/gpt4all development by creating an account on GitHub.

github.com

3

2

24

Nomic AI

@nomic_ai

2 years

@krea_ai How it works 👇 Every point is a user-generated image and its prompt. Points are close together if an AI considers their images similar. For example, Billionaires Row is a region containing co-located generations of @elonmusk , Jeff Bezos, Mark Zuckerburg and US dollars.

3

1

24

Nomic AI

@nomic_ai

1 year

Excited to see #gpt4all as the top project on this list! Thrilled to keep building this community with awesome open source companies like @huggingface . Congrats on the milestone!

Hugging Face

@huggingface

1 year

🤗 Transformers has been built by, with, and for the community. Reaching 100k ⭐ on GitHub is a testament to ML's reach and the community's will to innovate and contribute. To celebrate, we highlight 100 incredible projects in transformers' vicinity.

99

271

1K

0

2

24

Nomic AI

@nomic_ai

11 months

Run local LLMs on CPU in Java with GPT4All ☕ credit to elite hacker: @FZaslavskiy

0

6

22

Nomic AI

@nomic_ai

1 month

shout out @AntimetalCloud for the free tier slices as a service. also shout out @dingboard_ for the mug.

0

23

Nomic AI

@nomic_ai

1 year

Great GPT4All writeup in Towards AI today:

LLaMA-GPT4All: Simplified Local ChatGPT

Meta LLaMA-based GPT4All for your local ChatGPT clone solution

pub.towardsai.net

1

5

22

Nomic AI

@nomic_ai

11 months

To make this possible, GPT4All hackers had to implement several custom Apple Metal kernels for LLM ops (e.g. Alibi) and support a custom fork llama.cpp! Excited to get these changes upstream!

GitHub - nomic-ai/llama.cpp: Nomic Vulkan Fork of LLaMa.cpp

Nomic Vulkan Fork of LLaMa.cpp. Contribute to nomic-ai/llama.cpp development by creating an account on GitHub.

github.com

1

6

22

Nomic AI

@nomic_ai

3 months

The input token embedding space of nomic-embed-text-v1

nomic-embed-vocabulary-normed | Atlas

atlas.nomic.ai

0

4

22

Nomic AI

@nomic_ai

1 year

Great article from @GoogleCloudTech about using Nomic's Atlas to visualize 8M stack overflow questions!

How to use Grounding for your LLMs with text embeddings | Google Cloud Blog

Embeddings for text, vector search, large language models and integration methods for your internal apps.

cloud.google.com

0

6

21

Nomic AI

@nomic_ai

6 months

GPT4All will auto-index your arbitrary corpus of documents with a simple desktop GUI allowing you to securely and privately chat with your data. Switch between any local LLM including @MistralAI ! Learn more in the documentation:

1

3

21

Nomic AI

@nomic_ai

11 months

Excited to have our Falcon ggML implementation included in llama.cpp! Kudos to elite hacker @apage43

Tom Jobbins

@TheBlokeAI

11 months

We have first Falcon 40B GGML support! Thanks to the amazing efforts of @apage43 , Jan Ploski et al at Support is *experimental*. Won't work with UIs etc. Here's two Falcon 40B models in GGML: Pls read README!

4

29

203

1

2

19

Nomic AI

@nomic_ai

22 days

Want to hear what's in store for local AI in 2024? Join the GPT4All 2024 Townhall tomorrow at 12pm EST.

1

3

18

Nomic AI

@nomic_ai

11 months

@jayecreates The larger models in the GPT4All ecosystem are not to bad at coding assistance. For code writing, we have optimized replits model to run on CPU and that will be joining the ecosystem when this PR is tested on all operating systems and merges.

Replit Model by rguo123 · Pull Request #713 · nomic-ai/gpt4all

Describe your changes script to convert hugging face replit model to ggml ggml replit model backend + llmodel_c library integration Python bindings for replit model (currently this does not work ...

github.com

3

20

Nomic AI

@nomic_ai

11 months

- You can side-load almost any local LLM (GPT4All supports more than just LLaMa) - Everything runs on CPU - yes it works on your computer! - Dozens of developers actively working on it squash bugs on all operating systems and improve the speed and quality of models

9

0

20

Nomic AI

@nomic_ai

1 year

Air gap the chatbots.

Jaya Gupta

@JayaGup10

1 year

💀

13

93

0

19

Nomic AI

@nomic_ai

1 year

Nomic is hiring! Come help us build an open, observable, and democratized future for AI.

Careers @ Nomic | Notion

Nomic is hiring for the following roles: All new roles must submit through: https://jobs.ashbyhq.com/nomic.ai

nomic-ai.notion.site

0

6

19

Nomic AI

@nomic_ai

8 months

@andersonbcdefg @yaGreatApe Our resident hacker @benmschmidt made one:

1

2

19

Nomic AI

@nomic_ai

1 year

Spot on.

Alexander Doria

@Dorialexander

1 year

So it was high time for a new update of the political compass of AI. Now featuring Alpaca (Stanford), @nomic_ai with GPT4All and BloombergGPT

3

14

53

1

4

16

Nomic AI

@nomic_ai

29 days

GPT4All Townhall Announcement: 2024 Roadmap Date: April 18th, 12pm EST - Outline and discuss the 2024 Roadmap - Opportunity for preview and feedback on GPT4All's next generation local LLM interface.

2

18

Nomic AI

@nomic_ai

10 months

@FardeemM OpenAI is usually worse in both resource usage and performance (e.g. retrieval) Source: the author of SBERT who is now at @cohere

Nils Reimers

@Nils_Reimers

2 years

GPT-3 Embeddings by @OpenAI was announced this week. 📈 I was excited and tested them on 20 datasets 😢 Sadly they are worse than open models that are 1000 x smaller 💰 Running @OpenAI models can be a 1 million times more expensive

40

438

2K

3

0

18

Nomic AI

@nomic_ai

11 months

Previously, constant breaking changes to llama.cpp made it difficult to find software that works with any off-the-shelf local LLM. The GPT4All ecosystem will now dynamically load the right versions without any intervention! LLMs should *just work*!

2

1

17

Nomic AI

@nomic_ai

10 months

Our product GPT4All let's anyone run powerful large language models on resource constrained devices. With this new funding, we are doubling down on our commitment to open-source and excited to accelerate work with our industry partners such as open-source giant @MongoDB .

2

1

16

Nomic AI

@nomic_ai

3 months

Integrations: @LangChainAI integration: @llama_index integration: @MongoDB :

Building a RAG LLM with Nomic Embed and MongoDB

blog.nomic.ai

3

0

16

Nomic AI

@nomic_ai

11 months

GPT4All lets you run 13 officially supported models and side load hundreds from @huggingface . The following architectures are supported in GGML. - LLaMA - MPT - Falcon

0

3

16

Nomic AI

@nomic_ai

6 months

Figure 3: The training set of an open-source multimodal LLM. featuring model creator: @SanhEstPasMoi

0

1

16

Nomic AI

@nomic_ai

27 days

nomic embed text 1.5 has done this since feb

mixedbreadai

@mixedbreadai

27 days

Follow-up on binary embeddings: 64 bytes per embedding, yee-haw 🤠 Reduces memory usage of our embedding model by more than 98% (64x) while retaining over 90% of model performance with binary 🪆 Model: Blog:

2

22

107

2

16

Nomic AI

@nomic_ai

1 year

Cool to see the Nomic shout out in @StabilityAI 's StableLM readme! I do hope they didn't fine tune directly on the chosen responses of the @AnthropicAI HH dataset though...

Stability AI

@StabilityAI

1 year

Announcing StableLM❗ We’re releasing the first of our large language models, starting with 3B and 7B param models, with 15-65B to follow. Our LLMs are released under CC BY-SA license. We’re also releasing RLHF-tuned models for research use. Read more→

73

930

4K

1

2

15

Nomic AI

@nomic_ai

11 months

Huge shout out to @amasad and the team @Replit for building and open sourcing the base model!

0

15

Nomic AI

@nomic_ai

10 months

Current AI technology is concentrated in the hands of a very small number of companies with outsized access to compute resources. We are changing that: Powerful models should be accessible to all: The tools to build them as well:

2

15

Nomic AI

@nomic_ai

2 years

Large Language Models like #T5 and #GPT3 are evaluated on benchmark datasets such as (super) GLUE. But what's in them? Explore and search the contents of the GLUE benchmark with Atlas. QNLI: @sleepinyourhat

1

2

15

Nomic AI

@nomic_ai

3 months

@EssentialBusin7 @llama @MongoDB Each dot is a piece of data. Two dots are close if they have similar embeddings. At Nomic we believe that it is organic - embeddings are a function of meaning, meaning is a function of culture, and culture is a function of organic animal behavior.

2

0

14