Skip to content
View Laicheng0830's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Peng Cheng Laboratory
  • Shenzhen, China
  • 11:26 (UTC +08:00)

Organizations

@tensorlayer @openmlsys

Block or report Laicheng0830

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Python 31,092 3,831 Updated Sep 19, 2024

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…

Python 1,821 304 Updated Sep 20, 2024

Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型,支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.

Jupyter Notebook 467 50 Updated Jul 11, 2024

中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。

Python 1,141 138 Updated Apr 20, 2024

Ongoing research training transformer models at scale

Python 10,037 2,260 Updated Sep 21, 2024

The respository of jec-qa.

Python 49 2 Updated Feb 2, 2020

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Python 19,020 2,632 Updated Sep 19, 2024

A high-performance inference system for large language models, designed for production environments.

C++ 373 26 Updated Sep 20, 2024

A series of large language models trained from scratch by developers @01-ai

Jupyter Notebook 7,607 470 Updated Sep 19, 2024

Inference code for CodeLlama models

Python 15,879 1,845 Updated Aug 12, 2024

4 bits quantization of LLaMA using GPTQ

Python 2,983 457 Updated Jul 13, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 34,812 4,052 Updated Sep 21, 2024

OpenPose uses Pytorch for static quantization, saving, and loading of models

Python 77 12 Updated Jul 21, 2021

A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.

15,323 2,812 Updated Feb 1, 2024

A machine learning compiler for GPUs, CPUs, and ML accelerators

C++ 2,583 403 Updated Sep 21, 2024

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

Jupyter Notebook 13,256 3,174 Updated Aug 12, 2024

compiler learning resources collect.

Python 2,062 323 Updated May 27, 2024

A multi-backend graph learning library.

Python 213 76 Updated Aug 19, 2024

functorch is JAX-like composable function transforms for PyTorch.

Jupyter Notebook 1,389 102 Updated Sep 21, 2024

MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Convert models between Caffe, Keras, MXNet, Tensorflow, CNTK, …

Python 5,786 964 Updated May 29, 2024

ONNX Model Exporter for TensorLayerX

Python 21 1 Updated Sep 19, 2022

Making large AI models cheaper, faster and more accessible

Python 38,629 4,331 Updated Sep 19, 2024

Convert Machine Learning Code Between Frameworks

Python 14,021 5,771 Updated Sep 20, 2024

《C++模板元编程实战:一个深度学习框架的初步实现》

C++ 172 57 Updated Jun 16, 2019

《Machine Learning Systems: Design and Implementation》- Chinese Version

TeX 3,961 430 Updated Apr 13, 2024

TensorLayerX: A Unified Deep Learning and Reinforcement Learning Framework for All Hardwares, Backends and OS.

Python 536 44 Updated Sep 8, 2024

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

Python 674 112 Updated Oct 23, 2023

Official implementation of "OpenPifPaf: Composite Fields for Semantic Keypoint Detection and Spatio-Temporal Association" in PyTorch.

Python 1,162 246 Updated Aug 15, 2024
Next