Data Science
Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.
Here are 200 public repositories matching this topic...
Matplot++: A C++ Graphics Library for Data Visualization 📊🗾
-
Updated
Oct 23, 2024 - C++
🔨 🍇 💻 🚀 GraphScope: A One-Stop Large-Scale Graph Computing System from Alibaba | 一站式图计算系统
-
Updated
Nov 6, 2024 - C++
High performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.
-
Updated
Aug 28, 2023 - C++
Shōgun
-
Updated
Dec 19, 2023 - C++
C++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types and contiguous memory storage
-
Updated
Nov 9, 2024 - C++
chDB is an in-process OLAP SQL Engine 🚀 powered by ClickHouse
-
Updated
Nov 7, 2024 - C++
BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.
-
Updated
Sep 16, 2022 - C++
The Universal Storage Engine
-
Updated
Nov 8, 2024 - C++
ArcticDB is a high performance, serverless DataFrame database built for the Python Data Science ecosystem.
-
Updated
Nov 9, 2024 - C++
A library created to revitalize C++ as a machine learning front end. Per aspera ad astra.
-
Updated
Feb 25, 2022 - C++
Epsilla is a high performance Vector Database Management System. Try out hosted Epsilla at https://cloud.epsilla.com/
-
Updated
Aug 26, 2024 - C++
Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with the Python Database API Specification 2.0.
-
Updated
Nov 4, 2024 - C++
oneAPI Data Analytics Library (oneDAL)
-
Updated
Nov 8, 2024 - C++
Combining tree-boosting with Gaussian process and mixed effects models
-
Updated
Nov 4, 2024 - C++
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.
-
Updated
Nov 3, 2024 - C++
A Lean Persistent Homology Library for Python
-
Updated
Oct 14, 2024 - C++
LabPlot is a FREE, open source and cross-platform Data Visualization and Analysis software accessible to everyone.
-
Updated
Nov 9, 2024 - C++
A visualisation tool for the creation and analysis of graphs
-
Updated
Sep 11, 2024 - C++
2017-2021 Sakarya Üniversitesi Bilgisayar Mühendisliği Bölümü ders notları, sınavlar, kod örneklerini içermektedir.
-
Updated
May 19, 2022 - C++
- Followers
- 4.1k followers
- Wikipedia
- Wikipedia