EricZ's repos on GitHub
Python · 2928 watchers
datasketch
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
Go · 108 watchers
lsh
Locality Sensitive Hashing for Go (Multi-probe LSH, LSH Forest, basic LSH)
Go · 62 watchers
lshensemble
LSH index for approximate set containment search
Go · 35 watchers
go-fasttext
Facebook fastText database in SQLite with Go API
Go · 26 watchers
go-sql-lsh
Locality Sensitive Hashing using Golang and SQL database
Go · 20 watchers
josie
Code and Benchmarks for JOSIE (SIGMOD 2019)
Go · 11 watchers
go-datasketch
Probabilistic data structures for processing very large datasets (MinHash, HyperLogLog)
JavaScript · 10 watchers
planning-poker
Planning Poker game for scrum team planning using Meteor.js
Go · 9 watchers
datatable
An in-memory relational table in Go similar to C#'s System.Data.DataTable.
Go · 4 watchers
counter
A frequency counter similar to Python's collections.Counter with additional support of other statistics.
Python · 3 watchers
rfc6266
Content-Disposition header support for Python
Python · 2 watchers
automl-gs
Provide an input CSV and a target field to predict, generate a model + code to run it.
Python · 2 watchers
gpt_index
GPT Index (LlamaIndex) is a project consisting of a set of data structures designed to make it easier to use large external knowledge bases with LLMs.
1 watchers
AgenticCookBook
The “Agentic Cookbook for Generative AI Agent usage” is a comprehensive guide designed to empower users with the knowledge and tools to effectively implement and utilize Generative AI Agents within their workflows.
1 watchers
autogen-ext-mcp
Turns Model Context Protocol server tools available in AutoGen >= v0.4
Python · 1 watchers
autogen-migration
Helpful scripts to migrate between autogen versions
1 watchers
awesome-llm-apps
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
1 watchers
FinRobot
FinRobot: An Open-Source AI Agent Platform for Financial Analysis using LLMs 🚀 🚀 🚀
Jupyter Notebook · 1 watchers
FLAML
A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.
1 watchers
garnet
Garnet is a remote cache-store from Microsoft Research that offers strong performance (throughput and latency), scalability, storage, recovery, cluster sharding, key migration, and replication features. Garnet can work with existing Redis clients.
Go · 1 watchers
go-minhash
BottomK minwise hashing for streaming set similarity
C++ · 1 watchers
mldb
MLDB is the Machine Learning Database
Python · 1 watchers
nserc-subjects
Use NSERC award application summaries to predict research subjects
Jupyter Notebook · 0 watchers
autogen
Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ
0 watchers
Autogen_GraphRAG_Ollama
Microsoft's GraphRAG + AutoGen + Ollama + Chainlit = Fully Local & Free Multi-Agent RAG Superbot
Jupyter Notebook · 0 watchers
big-ann-benchmarks
Framework for evaluating ANNS algorithms on billion scale datasets.
Java · 0 watchers
bigdata-interop
Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.
JavaScript · 0 watchers
binaryworm
A small web game inspired by a puzzle.
Go · 0 watchers
binsort
Binsort is a tool to sort files of fixed-length binary records
C · 0 watchers
bitarray
efficient arrays of booleans for Python
Python · 0 watchers
ckanapi
A command line interface and Python module for accessing the CKAN Action API
TeX · 0 watchers
csc373ta
Tutorial materials for CSC373
Rust · 0 watchers
differential-dataflow
An implementation of differential dataflow using timely dataflow on Rust.
0 watchers
DiskANN
Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search
JavaScript · 0 watchers
drawsomesolver
A simple solver for Draw Something
Python · 0 watchers
dspy
DSPy: The framework for programming—not prompting—language models
0 watchers
floci
Light, fluffy, and always free - AWS Local Emulator
0 watchers
gitignore
A collection of useful .gitignore templates
Go · 0 watchers
go-mysql-server
An extensible MySQL server implementation in Go.
C · 0 watchers
go-sqlite3
sqlite3 driver for go that using database/sql
Python · 0 watchers
gpt-2
Code for the paper "Language Models are Unsupervised Multitask Learners"
0 watchers
h2ogpt
Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/
Python · 0 watchers
hgobequip
Equipment management panel for Hanggao Observatory
0 watchers
hnswlib
Header-only C++/python library for fast approximate nearest neighbors
CSS · 0 watchers
indepth
Astrophotography Gallery for Hanggao Observatory
0 watchers
joey
baby quokka
Go · 0 watchers
lane
A golang queues, stacks and deques implementation library
Python · 0 watchers
langchain
⚡ Building applications with LLMs through composability ⚡
JavaScript · 0 watchers
leon
🧠 Leon is your open-source personal assistant.
JavaScript · 0 watchers
luceneutil
Various utility scripts for running Lucene performance tests
Python · 0 watchers
messytables
Tools for parsing messy tabular data. This is now superseded by https://github.com/frictionlessdata/tabulator-py
0 watchers
msticpy
Microsoft Threat Intelligence Security Tools
C++ · 0 watchers
nmslib
Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-metric spaces.
0 watchers
openclaw
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
0 watchers
OptiGuide
Large Language Models for Supply Chain Optimization
0 watchers
paper-gpt
Paper utilities using LLM
Go · 0 watchers
pgfutter
Import CSV and JSON into PostgreSQL the easy way
Python · 0 watchers
pg_probackup
Backup and recovery manager for PostgreSQL
C · 0 watchers
pg_query_state
Tool for query progress monitoring in PostgreSQL
C · 0 watchers
pg_similarity
set of functions and operators for executing similarity queries
Shell · 0 watchers
postgres
Docker Official Image packaging for Postgres
Python · 0 watchers
promptflow
Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.
Go · 0 watchers
prototool
Your Swiss Army Knife for Protocol Buffers
Go · 0 watchers
pulumi
Define cloud apps and infrastructure in your favorite language and deploy to any cloud
Python · 0 watchers
pysparnn
Approximate Nearest Neighbor Search for Sparse Data in Python!
Python · 0 watchers
python-magic
A python wrapper for libmagic
TypeScript · 0 watchers
qwen-code
An open-source AI agent that lives in your terminal.
Python · 0 watchers
QwenPaw
Your Personal AI Assistant; easy to install, deploy on your own machine or on the cloud; supports multiple chat apps with easily extensible capabilities.
0 watchers
rlm
General plug-and-play inference library for Recursive Language Models (RLMs), supporting various sandboxes.
Python · 0 watchers
sampleproject
A sample project that exists for PyPUG's "Tutorial on Packaging and Distributing Projects"