UbeCc

Follow

Haoran Wang UbeCc

Follow

I am not a beast of burden. I am a LLaMA! 不是牛马是拉马（我不是奶龙）

34 followers · 113 following

Tsinghua University
Beijing, China
02:38 (UTC +08:00)
[email protected]
@UbecWang

Achievements

Achievements

Highlights

Pro

Pinned Loading

OpenRLHF/OpenRLHF OpenRLHF/OpenRLHF Public

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 5.2k 512
Shape-Control-of-DLO Shape-Control-of-DLO Public

project for Deep Reinforcement Learning spring 24, Tsinghua Univ.

Python 4
Generalization-of-Transformers Generalization-of-Transformers Public

Code for paper "Generalization of Transformers with In-Context Learning: An Empirical Study"

Python