Change the repository type filter
All
Repositories list
30 repositories
evalscope
PublicA streamlined and customizable framework for efficient large model evaluation and performance benchmarking- Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL2, Phi3.5-Vision, GOT-OCR2, ...).
DiffSynth-Studio
Publicdata-juicer
PublicData processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷FunASR
PublicA Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.- ModelScope: bring the notion of Model-as-a-Service to life.
agentscope
PublicStart building LLM-empowered multi-agent applications in an easier way.- Enjoy easier conversations with LLM
modelscope-studio
PublicA third-party component library based on Gradio.r-chain
PublicMemoryScope
Publicscepter
PublicSCEPTER is an open-source framework used for training, fine-tuning, and inference with generative models.ClearerVoice-Studio
PublicAn AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.dash-infer
PublicDashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including CUDA, x86 and ARMv9.- A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
modelscope-agent
PublicModelScope-Agent: An agent framework connecting models in ModelScope with the worldfacechain
Publiccomfyscope
Public- [CVPR2024 (Highlight)] RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail Richness in Text-to-3D. Live Demo:https://modelscope.cn/studios/Damo_XR_Lab/3D_AIGC
motionagent
PublicFunClip
PublicOpen-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.lite-sora
Publicnormal-depth-diffusion
PublicFunCodec
PublicFunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.- KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech
AdaSeq
PublicAdaSeq: An All-in-One Library for Developing State-of-the-Art Sequence Understanding Models