ModelScope

All

30 repositories

evalscope
Public
A streamlined and customizable framework for efficient large model evaluation and performance benchmarking
performance evaluation vlm rag llm
Python
•
Apache License 2.0
•52•474•26•1•Updated Feb 25, 2025Feb 25, 2025
ms-swift
Public
Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL2, Phi3.5-Vision, GOT-OCR2, ...).
agent deploy llama lora embedding liger peft multimodal sft distill
Python
•
Apache License 2.0
•498•5.8k•382•12•Updated Feb 25, 2025Feb 25, 2025
DiffSynth-Studio
Public
Enjoy the magic of Diffusion models!
Python
•
Apache License 2.0
•637•6.9k•133•2•Updated Feb 25, 2025Feb 25, 2025
data-juicer
Public
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
nlp data-science opendata data-visualization pytorch dataset chinese data-analysis llama gpt
Python
•
Apache License 2.0
•211•3.7k•28•12•Updated Feb 25, 2025Feb 25, 2025
FunASR
Public
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
pytorch speech-recognition vad punctuation whisper audio-visual-speech-recognition speaker-diarization voice-activity-detection conformer pretrained-model
Python
•
Other
•869•8.4k•248•9•Updated Feb 25, 2025Feb 25, 2025
modelscope
Public
ModelScope: bring the notion of Model-as-a-Service to life.
nlp science cv speech multi-modal python machine-learning deep-learning
Python
•
Apache License 2.0
•763•7.4k•16•3•Updated Feb 24, 2025Feb 24, 2025
agentscope
Public
Start building LLM-empowered multi-agent applications in an easier way.
agent drag-and-drop chatbot multi-agent multi-modal distributed-agents gpt-4 large-language-models llm llm-agent
Python
•
Apache License 2.0
•372•6.3k•32•19•Updated Feb 24, 2025Feb 24, 2025
awesome-deep-reasoning
Public
Collect every awesome work about r1!
collection rl reasoning r1 o1 qwen deepseek grpo
Python
•5•207•0•0•Updated Feb 24, 2025Feb 24, 2025
PromptScope
Public
Enjoy easier conversations with LLM
prompt multi-modal gpt-4 in-context-learning large-language-models prompt-engineering llms
Python
•
Apache License 2.0
•2•24•2•0•Updated Feb 24, 2025Feb 24, 2025
modelscope-studio
Public
A third-party component library based on Gradio.
python ui gradio antd-design modelscope gradio-custom-component modelscope-studio
Python
•
Apache License 2.0
•10•82•4•0•Updated Feb 21, 2025Feb 21, 2025
r-chain
Public
Python
•
Apache License 2.0
•1•5•0•0•Updated Feb 18, 2025Feb 18, 2025
MemoryScope
Public
Python
•
Apache License 2.0
•40•400•6•0•Updated Feb 17, 2025Feb 17, 2025
scepter
Public
SCEPTER is an open-source framework used for training, fine-tuning, and inference with generative models.
generative-model scedit aigc lar-gen stylebooth
Python
•
Apache License 2.0
•27•465•10•0•Updated Feb 17, 2025Feb 17, 2025
ClearerVoice-Studio
Public
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
audio deep-learning speech pytorch speech-separation speech-enhancement noise-suppression speaker-extraction bandwidth-extension speech-super-resolution
Python
•
Apache License 2.0
•168•2.3k•23•3•Updated Feb 14, 2025Feb 14, 2025
dash-infer
Public
DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including CUDA, x86 and ARMv9.
cpu cuda llm llm-inference native-engine guided-decoding
C
•
Apache License 2.0
•24•234•10•0•Updated Feb 13, 2025Feb 13, 2025
3D-Speaker
Public
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
speaker-verification speaker-diarization language-identification voxceleb modelscope campplus eres2net 3d-speaker cnceleb sdpn
Python
•
Apache License 2.0
•140•1.7k•0•0•Updated Jan 17, 2025Jan 17, 2025
modelscope-agent
Public
ModelScope-Agent: An agent framework connecting models in ModelScope with the world
agent data-science code chatbot android-application multi-agents rag mobile-agents gpts llm
Python
•
Apache License 2.0
•334•2.9k•70•3•Updated Jan 8, 2025Jan 8, 2025
modelscope-classroom
Public
Jupyter Notebook
•
Apache License 2.0
•81•716•1•0•Updated Dec 31, 2024Dec 31, 2024
langchain-modelscope
Public
Langchain integration for ModelScope
Python
•
Apache License 2.0
•1•4•0•0•Updated Dec 27, 2024Dec 27, 2024
facechain
Public
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
Jupyter Notebook
•
Apache License 2.0
•868•9.3k•12•2•Updated Dec 10, 2024Dec 10, 2024
comfyscope
Public
Collection of various Comfy components.
Python
•
Apache License 2.0
•1•3•0•2•Updated Nov 20, 2024Nov 20, 2024
richdreamer
Public
[CVPR2024 (Highlight)] RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail Richness in Text-to-3D. Live Demo：https://modelscope.cn/studios/Damo_XR_Lab/3D_AIGC
Python
•
Apache License 2.0
•20•447•16•0•Updated Sep 27, 2024Sep 27, 2024
motionagent
Public
MotionAgent is your AI assistent to convert ideas into motion pictures.
Python
•
Apache License 2.0
•36•291•3•1•Updated Sep 2, 2024Sep 2, 2024
FunClip
Public
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
speech-recognition speech-to-text gradio video-clip subtitles-generator video-subtitles llm gradio-python-llm
Python
•
MIT License
•470•4.2k•30•2•Updated Aug 22, 2024Aug 22, 2024
lite-sora
Public
An initiative to replicate Sora
Python
•
Apache License 2.0
•7•103•3•0•Updated Apr 10, 2024Apr 10, 2024
normal-depth-diffusion
Public
Python
•
Apache License 2.0
•9•127•6•0•Updated Feb 7, 2024Feb 7, 2024
FunCodec
Public
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
tts speech-synthesis codec speech-to-text audio-generation encodec voicecloning audio-quantization
Python
•
MIT License
•32•387•20•1•Updated Jan 25, 2024Jan 25, 2024
KAN-TTS
Public
KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech
modelscope speech tts speech-synthesis
Python
•
MIT License
•85•500•42•1•Updated Dec 28, 2023Dec 28, 2023
AdaSeq
Public
AdaSeq: An All-in-One Library for Developing State-of-the-Art Sequence Understanding Models
natural-language-processing information-extraction chinese-nlp word-segmentation bert sequence-labeling relation-extraction natural-language-understanding entity-typing token-classification
Python
•
Apache License 2.0
•41•430•32•0•Updated Nov 15, 2023Nov 15, 2023
kws-training-suite
Public
Python
•
MIT License
•20•101•7•0•Updated May 26, 2023May 26, 2023