Philip Vollet Profile Banner
Philip Vollet Profile
Philip Vollet

@philipvollet

29,533
Followers
5,482
Following
1,859
Media
16,927
Statuses

Head of Developer Growth @weaviate_io & Open source lover tweeting about machine learning and data science projects.

Berlin, Germany
Joined November 2011
Don't wanna be here? Send us removal request.
Pinned Tweet
@philipvollet
Philip Vollet
2 years
When Nvidia said they were sending me a DGX A100 for machine learning, they didn't say I had to build a room for it! Say hello to Zubi & Oompa Loompa ๐Ÿˆ btw. these are Maine Coons, they are a bit smaller than tigers. I am not prone to exaggeration. How big was the Cray-1 ๐Ÿค”
Tweet media one
45
24
477
@philipvollet
Philip Vollet
2 years
NN-SVG is a tool for creating Neural Network architecture drawings parametrically rather than manually! It also provides the ability to export those drawings to Scalable Vector Graphics (SVG) files, suitable for inclusion in academic papers or web pages
29
926
4K
@philipvollet
Philip Vollet
2 years
PlotNeuralNet: Use LaTex for making neural networks diagrams!
Tweet media one
22
727
3K
@philipvollet
Philip Vollet
2 years
NN-SVG is a tool for creating Neural Network architecture drawings parametrically rather than manually! It also provides the ability to export those drawings to Scalable Vector Graphics (SVG) files, suitable for inclusion in academic papers or web pages
24
477
2K
@philipvollet
Philip Vollet
2 years
The entire Python Data Science Handbook in the form of free Jupyter notebooks!
Tweet media one
22
507
2K
@philipvollet
Philip Vollet
2 years
Parsr transforms PDF, documents, and images into enriched structured data
Tweet media one
18
347
2K
@philipvollet
Philip Vollet
2 years
handcalcs: a library to render Python calculation code automatically in Latex for your Jupyter Notebook! In a manner that mimics handwritten math: write the symbolic formula, followed by numeric substitutions, and then the result.
19
359
2K
@philipvollet
Philip Vollet
2 years
In need to draw your system architecture? Don't want to use slow and expensive Microsoft Visio? Diagrams lets you draw your system architecture in Python code. It was born for prototyping a new system architecture design without any design tools.
Tweet media one
23
365
2K
@philipvollet
Philip Vollet
2 years
Book lottery! Machine Learning with PyTorch and Scikit-Learn: Develop machine learning and deep learning models with Python. @rasbt Like and you're in the pool for one of three copies!
Tweet media one
28
71
1K
@philipvollet
Philip Vollet
2 years
Thinking about how to visualize text? Here is a huge collection of possibilities for inspiration!
23
285
1K
@philipvollet
Philip Vollet
2 years
This repository contains Jupyter notebooks implementing the code samples found in the book Deep Learning with Python GitHub Book
Tweet media one
9
304
1K
@philipvollet
Philip Vollet
3 years
River is a Python library for online machine learning. It is the result of a merger between creme and scikit-multiflow. River's ambition is to be the go-to library for doing machine learning on streaming data.
Tweet media one
14
226
1K
@philipvollet
Philip Vollet
2 years
Gooey โ€” Turn (almost) any Python command line program into a full GUI application with one line! pip install Gooey Don't forget to star the repository!
Tweet media one
18
207
1K
@philipvollet
Philip Vollet
2 years
Top2Vec is an algorithm for topic modeling and semantic search. It automatically detects topics present in text and generates jointly embedded topic, document and word vectors. GitHub Paper
Tweet media one
13
215
1K
@philipvollet
Philip Vollet
3 years
Where to get data for your next machine learning project? An overview of 8 amazing resources to accelerate your next project with data! - Google Datasets - Big Bad NLP Datasets - Hugging Face Datasets - Papers with Code Datasets - Open Data on AWS - Awesome Public Datasets
17
306
1K
@philipvollet
Philip Vollet
3 years
Do you need social media data for your machine learning project? - Twitter data? - Reddit data? - Facebook data? Where to get it?
20
233
916
@philipvollet
Philip Vollet
2 years
Thinking about how to visualize text? Here is a huge collection of possibilities for inspiration!
5
172
912
@philipvollet
Philip Vollet
2 years
Create super fast animated charts for your Jupyter Notebook with ipyvizzu pip install ipyvizzu Don't forget to star the repository!
13
154
831
@philipvollet
Philip Vollet
2 years
Pandas Tutor lets you write Python pandas code in your browser and see how it transforms your data step-by-step.
3
141
776
@philipvollet
Philip Vollet
2 years
Orchest: build data pipelines, the easy way! No frameworks. No YAML. Just write Python and R code in Notebooks. Don't forget to star the repository!
10
135
777
@philipvollet
Philip Vollet
2 years
Objectron a dataset of 15,000 annotated videos and 4M annotated images! GitHub
0
128
784
@philipvollet
Philip Vollet
4 years
PruneBERT has just been released save up to 97% of the original parameters and win incredible performance Paper GitHub #deeplearning #machinelearning #python #nlp #datascience @huggingface
Tweet media one
4
233
761
@philipvollet
Philip Vollet
3 years
PlotNeuralNet: Latex code for making neural networks diagrams
Tweet media one
7
145
758
@philipvollet
Philip Vollet
2 years
Using Self-Organizing Maps to solve the Traveling Salesman Problem
18
117
758
@philipvollet
Philip Vollet
3 years
Where to find trending machine learning papers? 3 tools to find what's trending:
12
166
710
@philipvollet
Philip Vollet
2 years
Roadmap to becoming an Artificial Intelligence Expert!
Tweet media one
17
193
711
@philipvollet
Philip Vollet
3 years
Draw the architecture in Python code Prototyping a new system architecture design without any design tools. $ pip install diagrams
Tweet media one
11
164
719
@philipvollet
Philip Vollet
2 years
Awesome Explainable Graph Reasoning! A collection of research papers and software related to explainability in graph machine learning. @benrozemberczki Don't forget to spend some star love for the repository!
Tweet media one
3
130
722
@philipvollet
Philip Vollet
2 years
Schedule your Jupyter Notebooks and send the results as HTML report! Notebooker executes your Jupyter notebooks when you commit to Git! Turning your Jupyter Notebook into a production-style web-based report in a few clicks.
Tweet media one
7
154
698
@philipvollet
Philip Vollet
3 years
The unofficial PyTorch implementation of the Attention Free Transformer by Apple Inc. $ pip install aft-pytorch Don't forget to spend some star love for the repository!
Tweet media one
6
122
691
@philipvollet
Philip Vollet
9 months
StackOverflow implemented its semantic search solution with Weaviate. How did they do it? They used a pre-trained BERT model from the SentenceTransformers library to generate the embeddings. Their reasons for using Weaviate: it's open source, and you can host it on your ownโ€ฆ
14
124
702
@philipvollet
Philip Vollet
2 years
I am looking for interns to help me build our Community at @explosion_ai - Become our next NLP & Machine Learning advocate - Build content and projects with our stack - Paid & full remote - We will slowly introduce you to the role and help you learn!
37
126
661
@philipvollet
Philip Vollet
3 years
Awesome Bioinformatics A curated list of awesome Bioinformatics software, resources, and libraries. Mostly command line based, and free or open-source.
Tweet media one
5
165
637
@philipvollet
Philip Vollet
2 years
The Last Machine & Deep-Learning Compendium Youโ€™ll Ever Need!
Tweet media one
8
138
620
@philipvollet
Philip Vollet
3 years
Open Source Alternatives 200+ open source alternatives to tools that businesses require in day-to-day operations
Tweet media one
8
172
610
@philipvollet
Philip Vollet
2 years
tsai is an open-source deep learning package built on top of Pytorch & fastai focused on state-of-the-art techniques for time series tasks like classification, regression, forecasting, imputation
Tweet media one
8
105
603
@philipvollet
Philip Vollet
3 years
Kobra a visual programming language for machine learning. Kobra is designed to help you learn machine learning without needing to learn how to code first.
Tweet media one
12
147
608
@philipvollet
Philip Vollet
3 years
Insights from an open source influencer I'm often asked how I get my content, over the years I've built an unusual technology stack for it Some insights:
Tweet media one
31
95
606
@philipvollet
Philip Vollet
2 years
Manim is an animation engine for explanatory math videos. It's used to create precise animations programmatically, as demonstrated in the videos of 3Blue1Brown.
8
124
609
@philipvollet
Philip Vollet
2 years
RPA for Python a package for doing Robotic Process Automation in Python $ pip install rpa
11
114
599
@philipvollet
Philip Vollet
2 years
D-Tale your GUI for pandas dataframes. It's like Excel for pandas! Your new tool for super-fast Exploratory Data Analysis (EDA) pip install dtale
Tweet media one
5
120
582
@philipvollet
Philip Vollet
2 years
Parsr transforms PDF, documents, and images into enriched structured data
Tweet media one
4
89
566
@philipvollet
Philip Vollet
3 years
darts a Python library for easy manipulation and forecasting of time series. It contains a variety of models, from classics such as ARIMA to deep neural networks.
Tweet media one
4
108
568
@philipvollet
Philip Vollet
3 years
FLAML - Fast and Lightweight AutoML by Microsoft a lightweight Python library that finds accurate machine learning models automatically, efficiently and economically.
Tweet media one
4
120
544
@philipvollet
Philip Vollet
2 years
TorchDrug is a PyTorch-based machine learning toolbox designed for drug discovery!
Tweet media one
2
127
535
@philipvollet
Philip Vollet
2 years
Pyinstrument a Python profiler whichs help you optimize your code - make it faster. To get the biggest speed increase you should focus on the slowest part of your program. Pyinstrument helps you find it! $ pip install pyinstrument
Tweet media one
5
89
532
@philipvollet
Philip Vollet
3 years
A clone version of Github Copilot. Instead of using AI, this extension send your search query to google, retrive stackoverflow answers and autocomplete them for you. @harishkgarg
11
124
528
@philipvollet
Philip Vollet
3 years
New release VizTracer 0.14.0 analyze your Python code and find bottlenecks and problems during the execution of your program
Tweet media one
3
102
524
@philipvollet
Philip Vollet
2 years
Book lottery: Machine Learning in Biotechnology and Life Sciences: Build machine learning models using Python and deploy them on the cloud! Like and you're in the pool for one of three copies! @PacktPub
Tweet media one
11
52
517
@philipvollet
Philip Vollet
3 years
GPT-J the open source cousin of GPT-3, everyone can use it! A 6 billion parameter, autoregressive text generation model trained on The Pile. @arankomatsuzaki Product Hunt GitHub The Pile dataset
Tweet media one
7
137
512
@philipvollet
Philip Vollet
2 years
DocArray is a library for nested, unstructured data such as text, image, audio, video, 3D mesh. It allows deep learning engineers to efficiently process, embed, search, recommend, store, transfer the data with Pythonic API. JinaAI_
Tweet media one
4
95
464
@philipvollet
Philip Vollet
2 years
tldraw a tiny little drawing app!
5
83
473
@philipvollet
Philip Vollet
2 years
Rich a Python library for rich text and beautiful formatting in the terminal! Rich can also render pretty tables, progress bars, markdown, syntax highlighted source code, tracebacks, and more โ€” out of the box. @willmcgugan
Tweet media one
4
80
467
@philipvollet
Philip Vollet
2 years
geemap a Python package for interactive mapping with Google Earth Engine, ipyleaflet, and ipywidgets.
4
74
459
@philipvollet
Philip Vollet
10 months
LLaMA2-Accessory an open-source toolkit for pre-training, fine-tuning and deployment of Large Language Models and multimodal LLMs - Pre-training: RefinedWeb & StarCoder - Single-modal - Multi-modal fine-tuning - LLM for API Control
5
112
456
@philipvollet
Philip Vollet
2 years
Having trouble getting started with data annotation? Use bulk labeling to select clusters of text and annotate them ๐Ÿคฏ GitHub Bulk Binary annotation and a model in the loop Tutorial
11
89
457
@philipvollet
Philip Vollet
2 years
Edit your DataFrame like a spreadsheet! Mito is a Python package that lets you turn your data into an interactive spreadsheet. Each edit you make in Mito will generate the equivalent Python in the code cell below.
4
90
456
@philipvollet
Philip Vollet
2 years
Create super fast animated charts for your Jupyter Notebook with ipyvizzu pip install ipyvizzu Don't forget to star the repository!
9
75
449
@philipvollet
Philip Vollet
2 years
Ploomber is the fastest way to build data pipelines! Use your favorite editor: Jupyter, VSCode, PyCharm to develop interactively and deploy without code changes as Kubernetes, Airflow, AWS Batch, and SLURM pipelines. pip install ploomber
1
73
443
@philipvollet
Philip Vollet
2 years
Journalism AI โ€“ Quotes extraction for modular journalism - An NLP pipline to extract quotes from news articles using NER, add coreferencing information and format the results for an exploratory search tool! GitHub Blog
Tweet media one
5
79
438
@philipvollet
Philip Vollet
2 years
Google Research: Circuit Training: An open-source framework for generating chip floor plans with distributed deep reinforcement learning. Paper GitHub
Tweet media one
3
90
435
@philipvollet
Philip Vollet
3 years
New release: Aim v3.0.0 an open-source, self-hosted AI experiment tracking tool. Use Aim to deeply inspect hundreds of hyperparameter-sensitive training runs at once. GitHub Blog Web
3
79
434
@philipvollet
Philip Vollet
2 years
ipygany: Jupyter into the third dimension - A new interactive widgets library that allows you to visualize and analyze volumetric data in your Jupyter Notebook!
3
91
434
@philipvollet
Philip Vollet
2 years
RPA for Python a package for doing Robotic Process Automation in Python $ pip install rpa
2
90
422
@philipvollet
Philip Vollet
2 years
Graph Neural Networks for Novice Math Fanatics! @rishabh16_
Tweet media one
1
75
415
@philipvollet
Philip Vollet
2 years
Manim is an animation engine for explanatory math videos. It's used to create precise animations programmatically, as demonstrated in the videos of 3Blue1Brown.
2
67
415
@philipvollet
Philip Vollet
3 years
How to deploy machine learning models as a micro service using FastAPI by @tiangolo Advantages of using FastAPI โ€ข Make your code components reusable โ€ข Highly maintained โ€ข Ease of testing โ€ข Quick in response time GitHub
Tweet media one
@philipvollet
Philip Vollet
3 years
How do you create a beautiful interface for your machine learning or data science project? Handmade from scratch? Any good tools? Sure there are incredible tools:
17
78
365
1
107
410
@philipvollet
Philip Vollet
3 years
The Machine & Deep Learning Compendium Open Book. A comprehensive resource for data scientists & ML engineers. Mediumย  Bookย  Web GitHub
Tweet media one
3
104
405
@philipvollet
Philip Vollet
2 years
Kornia is a differentiable computer vision library for PyTorch It consists of a set of routines and differentiable modules to solve generic computer vision problems.
Tweet media one
0
59
399
@philipvollet
Philip Vollet
3 years
Transformers Interpret is a model explainability tool designed to work exclusively with the @huggingface transformers package Model explainability that works seamlessly with ๐Ÿค— transformerss. Explain your transformers model in just 2 lines of code.
Tweet media one
1
89
392
@philipvollet
Philip Vollet
3 years
SQLModel is a library for interacting with SQL databases from Python code, with Python objects It is designed to be intuitive, easy to use, highly compatible & robust SQLModel is based on Python type annotations, and powered by Pydantic and SQLAlchemy
Tweet media one
2
80
385
@philipvollet
Philip Vollet
2 years
Extracting information from PDFs or scanned documents is still a challenge! Use the @huggingface LayoutLMv3 model and Prodigy to tackle this challenge โœจ Blog GitHub
5
72
374
@philipvollet
Philip Vollet
2 years
NoiseCraft is an open source, browser-based visual programming language & platform for sound synthesis and music making that runs your a web browser @DrTBehrens
Tweet media one
9
58
364
@philipvollet
Philip Vollet
3 years
How do you create a beautiful interface for your machine learning or data science project? Handmade from scratch? Any good tools? Sure there are incredible tools:
17
78
365
@philipvollet
Philip Vollet
2 years
txtai: build AI-powered semantic search applications! @DavidMezzetti Don't forget to star the repository!
6
77
352
@philipvollet
Philip Vollet
2 years
The original Transformer implemented in PyTorch @gordic_aleksa
Tweet media one
2
55
349
@philipvollet
Philip Vollet
2 years
The Rise of Open Source Challengers! @rajko_rad Don't forget to spend some claps for the article!
Tweet media one
2
93
351
@philipvollet
Philip Vollet
2 years
Kornia is a differentiable computer vision library for PyTorch It consists of a set of routines and differentiable modules to solve generic computer vision problems.
Tweet media one
2
56
348
@philipvollet
Philip Vollet
2 years
Pyinstrument a Python profiler whichs help you optimize your code - make it faster. To get the biggest speed increase you should focus on the slowest part of your program. Pyinstrument helps you find it! $ pip install pyinstrument
Tweet media one
11
63
344
@philipvollet
Philip Vollet
3 years
TextDistance โ€” python library for comparing distance between two or more sequences by many algorithms.
Tweet media one
2
52
341
@philipvollet
Philip Vollet
2 years
DocArray is a library for nested, unstructured data such as text, image, audio, video, 3D mesh. It allows deep learning engineers to efficiently process, embed, search, recommend, store, transfer the data with Pythonic API. @JinaAI_
Tweet media one
0
63
346
@philipvollet
Philip Vollet
2 years
Orchest lets you code, run and monitor data pipelines all from your browser! It's an Airflow alternative that's easier to use. Instead of configuring cloud infrastructure, simply hit the schedule button in Orchest.
3
67
340
@philipvollet
Philip Vollet
2 years
Faker is a Python package that generates fake data for you. Whether you need to bootstrap your database, create good-looking XML documents, fill-in your persistence to stress test it, or anonymize data taken from a production service, Faker is for you!
Tweet media one
3
61
335
@philipvollet
Philip Vollet
2 years
FasterAI: Prune and Distill your models with FastAI and PyTorch! @HubensN
Tweet media one
5
67
334
@philipvollet
Philip Vollet
8 months
Today we're releasing Verba, the Golden RAGtriever It's completely open source, so you can bring your own data like internal knowledge base and documentation Use Verba to build your own RAG Retrieval Augmented Generation pipeline and utilize LLMs for internal-based outputsโ€ฆ
Tweet media one
23
71
339
@philipvollet
Philip Vollet
3 years
RPA for Python a package for doing Robotic Process Automation in Python $ pip install rpa Features โ€ข Web automation โ€ข Visual automation โ€ข OCR automation โ€ข Keyboard automation โ€ข Mouse automation
3
69
334
@philipvollet
Philip Vollet
3 years
Unofficial PyTorch implementation of Fastformer. An efficient Transformer model based on additive attention.
Tweet media one
6
47
328
@philipvollet
Philip Vollet
2 years
Deep Learning with PyTorch: Build, train, and tune neural networks using Python tools! Book GitHub
Tweet media one
2
78
322
@philipvollet
Philip Vollet
3 years
Layout Parser โ€ข A Python Library for Document Layout Understanding Github Paper Dataset $๐š™๐š’๐š™ ๐š’๐š—๐šœ๐š๐šŠ๐š•๐š• ๐š•๐šŠ๐šข๐š˜๐šž๐š๐š™๐šŠ๐š›๐šœ๐šŽ๐š› _
Tweet media one
2
69
324
@philipvollet
Philip Vollet
9 months
San Francisco is when your driver wrote software for the Nasa Pathfinder mission in 1997 ๐Ÿ”ด๐Ÿš€ At first I was a bit skeptical but he knew so much about Assembly, C and some already dead programming languages ๐Ÿคฃ what a legend I started in the 90s with Turbo Pascal and Basic pureโ€ฆ
20
21
317
@philipvollet
Philip Vollet
2 years
Panel a high-level app and dashboarding solution for Python! Panel provides tools for easily composing widgets, plots, tables, and other viewable objects and controls into custom analysis tools, apps, and dashboards @MarcSkovMadsen @Panel_org
Tweet media one
5
54
307
@philipvollet
Philip Vollet
3 years
Not enough training data for your NLP project? Data augmentation to the rescue! The standard in visual machine learning can also be used in natural language processing. But it works slightly different
Tweet media one
4
74
307
@philipvollet
Philip Vollet
3 years
Implementation of ResMLP, an all MLP solution to image classification out of Facebook AI, in Pytorch $ pip install res-mlp-pytorch @lucidrains
Tweet media one
5
45
299
@philipvollet
Philip Vollet
2 years
The Synthetic Data Vault (SDV) is a synthetic data generation ecosystem to easily learn single-table, multi-table, and time-series datasets to generate new synthetic data that has the same format and statistical properties as the original dataset.
Tweet media one
5
64
302
@philipvollet
Philip Vollet
2 years
Cog: Containers for machine learning an open source tool that lets you package machine learning models in a standard, production-ready container! - Docker containers without the pain - No more CUDA hell - Much more Don't forget to star the repository!
Tweet media one
6
62
297
@philipvollet
Philip Vollet
3 years
Implementation of Vision Transformer a simple way to achieve SOTA in vision classification with only a single transformer encoder in Pytorch $ ๐š™๐š’๐š™ ๐š’๐š—๐šœ๐š๐šŠ๐š•๐š• ๐šŸ๐š’๐š-๐š™๐šข๐š๐š˜๐š›๐šŒ๐š‘ GitHub Paper
1
62
293
@philipvollet
Philip Vollet
2 years
Mercury: easily convert Python notebook to web app and share with others!
Tweet media one
1
71
291
@philipvollet
Philip Vollet
2 years
SHAP (SHapley Additive exPlanations) is a game theoretic approach to explain the output of any machine learning model. It connects optimal credit allocation with local explanations using the classic Shapley values from game theory.
Tweet media one
4
51
289
@philipvollet
Philip Vollet
3 years
TorchIO medical image preprocessing and augmentation toolkit for deep learning in PyTorch. Efficiently read, preprocess, sample, augment, and write 3D medical images in deep learning applications.
5
68
283