- Beijing, China
-
23:03
- 8h ahead - https://publish.obsidian.md/sherlock/Welcome
- @xingyu_liao
Lists (1)
Sort Name ascending (A-Z)
Stars
DeepEP: an efficient expert-parallel communication library
MoBA: Mixture of Block Attention for Long-Context LLMs
Official Repo for Open-Reasoner-Zero
This repo contains the dataset and code for the paper "SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?"
Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
A library for advanced large language model reasoning
verl: Volcano Engine Reinforcement Learning for LLMs
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
Fully open reproduction of DeepSeek-R1
Training Large Language Model to Reason in a Continuous Latent Space
Ring attention implementation with flash attention
Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models
🗂️A file list/WebDAV program that supports multiple storages, powered by Gin and Solidjs. / 一个支持多存储的文件列表/WebDAV程序,使用 Gin 和 Solidjs。
Fast Matrix Multiplications for Lookup Table-Quantized LLMs
The open-source visual AI programming environment and TypeScript library
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…
Samples demonstrating how to use the Compute Sanitizer Tools and Public API