Stars
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
Golang database/sql driver for Databricks SQL.
small script to find Unifi cameras that come and go on the unifi website
DuckDB is an analytical in-process SQL database management system
An open protocol for secure data sharing
A concise grammar of interactive graphics, built on Vega.
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activelo…
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
Homebridge plugin for PurpleAir, for monitoring air quality in Apple HomeKit as well as home automation based on air quality changes.
Collection of homebridge plugin examples
The open source Firebase alternative. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.
📊 The concise and progressive visualization grammar.
Highly-available version-controlled service configuration repository based on Git, ZooKeeper and HTTP/2
A high performance caching library for Java
Toolkit for testing multi-threaded and asynchronous applications
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
dbt-spark contains all of the code enabling dbt to work with Apache Spark and Databricks
HandySpark - bringing pandas-like capabilities to Spark dataframes
Microsoft SEAL is an easy-to-use and powerful homomorphic encryption library.
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Easy TOC creation for GitHub README.md
by ex-googlers, for ex-googlers - a lookup table of similar tech & services