🎯
Focusing
machine learning engineer - (data)scientist - reformed quant - habitual coder - PhD
Highlights
- Pro
Stars
Natural Language Processing
9 repositories
just a bunch of useful embeddings for scikit-learn pipelines
The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".
Making text a first-class citizen in TensorFlow.
A tiny nearest-neighbor embedding database built with SQLite and Pytorch. (In development!)
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Adding guardrails to large language models.
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
tiktoken is a fast BPE tokeniser for use with OpenAI's models.