Skip to content
View foster999's full-sized avatar

Organizations

@cansenseltd

Block or report foster999

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Integrate your chemometric tools with the scikit-learn API 🧪 🤖

Python 45 6 Updated Aug 11, 2024

This is a repo with links to everything you'd ever want to learn about data engineering

10,367 1,435 Updated Sep 11, 2024

A build-it-yourself AutoML Framework

Python 61 4 Updated Aug 14, 2024

📘 The experiment tracker for foundation model training

Python 574 63 Updated Sep 19, 2024

A game theoretic approach to explain the output of any machine learning model.

Jupyter Notebook 22,526 3,250 Updated Sep 18, 2024

Open Raman Processing Library

Python 28 4 Updated Jun 21, 2024

Configuration Management for Python ⚙

Python 3,716 287 Updated Sep 1, 2024

CLI tool to generate terraform files from existing infrastructure (reverse Terraform). Infrastructure to Code

Go 12,440 1,630 Updated Sep 9, 2024

The R Installation Manager

Rust 623 21 Updated Sep 19, 2024

An extremely long review of R.

619 31 Updated Jul 25, 2023

Turns Data and AI algorithms into production-ready web applications in no time.

Python 11,949 833 Updated Sep 19, 2024

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 7,449 1,670 Updated Sep 19, 2024

A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin

6,125 626 Updated Sep 10, 2024

Guidance for quality assurance of code for civil service researchers and analysts.

CSS 75 18 Updated Sep 16, 2024

The Infrastructure for Project rAPId

HCL 9 Updated Oct 19, 2023

The Score Specification provides a developer-centric and platform-agnostic Workload specification to improve developer productivity and experience. It eliminates configuration inconsistencies betwe…

Makefile 7,776 2,203 Updated Sep 12, 2024

A set of functions for cleaning and matching census data with Post Enumeration Survey data

Python 5 2 Updated Nov 20, 2023

DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data.

Python 3,172 716 Updated Sep 11, 2024

Pulumi - Infrastructure as Code in any programming language 🚀

Go 20,993 1,092 Updated Sep 20, 2024

Dataframes powered by a multithreaded, vectorized query engine, written in Rust

Rust 29,328 1,855 Updated Sep 19, 2024

Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.

Jupyter Notebook 1,731 111 Updated Sep 19, 2024

A python package template by NHS England that can be adapted for RAP projects.

Python 20 5 Updated Sep 11, 2024

High-velocity, monorepo-scale workflow for Git

Rust 3,432 85 Updated Sep 16, 2024

An engine to run your pipelines in containers

Go 10,912 588 Updated Sep 19, 2024

A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton

Python 863 38 Updated Jul 3, 2023

Doing dirty (but extremely useful) things with equals.

Python 798 36 Updated Sep 14, 2024

Test-Driven Data Analysis Functions

Python 293 29 Updated Aug 18, 2024

A CLI to build linked data cubes.

Python 12 1 Updated Aug 22, 2024

SQL databases in Python, designed for simplicity, compatibility, and robustness.

Python 14,105 624 Updated Sep 18, 2024

Functions for price index economics.

Python 5 1 Updated Mar 30, 2021
Next