魔搭社区模型速递（4.20-4.26）

魔搭ModelScope社区

186人浏览 · 2025-04-28 10:51:15

魔搭ModelScope社区 · 2025-04-28 10:51:15 发布

🙋魔搭ModelScope本期社区进展：

📟2227个模型：Bitnet-b1.58-2B-4T、MAGI-1、Dia-1.6B、SkyReels-V2系列等；

📁270个数据集：OpenMathReasoning、xtreme、InternVL-Data等；

🎨167个创新应用：高度逼真的对话声音合成Dia-1.6B、hidream-arena、ChatTests编题等；

📄 10篇内容：

Ollama-Deep-Researcher-本地Mac结合魔搭社区模型搭建网页研究助手
答疑机器人实践：AgentScope多智能体带你玩转多源召回
微软推出bitnet-b1.58-2B-4T：极致的量化，小巧而强大
智源数据新基建系列 Workshop（第1期）来了！从大模型到具身智能，线上线下同步！
ImagePulse图律脉动数据集开源发布：解码GPT-4o级图像生成能力，四大原子数据集+自动生成工具开放
Prot2Chat：融合蛋白质序列与结构的新型问答框架
“一丹一世界”三等奖 | 绮影录创作分享
“一丹一世界”二等奖 | MAJICFLUS_新浮世创作分享
"一丹一世界"一等奖 | 曙光_麦橘超然创作分享
"一丹一世界"二等奖 | TPSZ_二次元卡通梦幻插画风格-童梦拾光_majiuFlux版本创作分享

01.模型推荐

Bitnet-b1.58-2B-4T

BitNet-b1.58-2B-4T 是微软开源的一种创新量化模型，采用1.58位的三进制量化方案（权重为+1、0、-1），旨在通过大幅降低计算成本和内存占用，提升大语言模型在端侧等资源受限场景下的应用效率，同时缓解传统量化方法中因精度降低而导致的准确率下降问题。

模型地址

https://modelscope.cn/models/AI-ModelScope/bitnet-b1.58-2B-4T

示例代码

使用transformers进行推理：

环境安装：

!pip install git+https://github.com/shumingma/transformers.git

示例代码


import torch
from modelscope import AutoModelForCausalLM, AutoTokenizer
model_id = "AI-ModelScope/bitnet-b1.58-2B-4T"
# Load tokenizer and model
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(
    model_id,
    torch_dtype=torch.bfloat16
)
# Apply the chat template

更多推理实战

微软推出bitnet-b1.58-2B-4T：极致的量化，小巧而强大

MAGI-1

MAGI-1是由Sand.ai团队开源的一个世界模型，它通过自回归预测一系列视频块（定义为固定长度的连续帧片段）来生成视频。MAGI-1 经过训练，可以去除随时间单调递增的单块噪声，从而实现因果时间建模，并自然支持流式生成。它在以文本指令为条件的图像到视频 (I2V) 任务中表现出色，提供了高度的时间一致性和可扩展性，这得益于多项算法创新和专用的基础架构堆栈。MAGI-1 还通过逐块提示支持可控生成，从而实现流畅的场景过渡、长时域合成和细粒度的文本驱动控制。我们相信，MAGI-1 为统一高保真视频生成、灵活的指令控制和实时部署提供了一个充满希望的方向。

模型地址

https://modelscope.cn/models/sand-ai/MAGI-1

示例代码

使用 Docker 环境运行（推荐）

docker pull sandai/magi:latest
docker run -it --gpus all --privileged --shm-size=32g --name magi --net=host --ipc=host --ulimit memlock=-1 --ulimit stack=6710886 sandai/magi:latest /bin/bash

使用源代码运行

# Create a new environment
conda create -n magi python==3.10.12
# Install pytorch
conda install pytorch==2.4.0 torchvision==0.19.0 torchaudio==2.4.0 pytorch-cuda=12.4 -c pytorch -c nvidia
# Install other dependencies
pip install -r requirements.txt
# Install ffmpeg
conda install -c conda-forge ffmpeg=4.4
# Install MagiAttention, for more information, please refer to https://github.com/SandAI-org/MagiAttention#
git clone git@github.com:SandAI-org/MagiAttention.git
cd MagiAttention
git submodule update --init --recursive
pip install --no-build-isolation .

SkyReels-V2系列

SkyReels-V2是昆仑万维SkyReels团队推出全球首创扩散强迫框架的无限时长电影生成模型，通过融合多模态大语言模型、多阶段预训练与强化学习技术实现联合优化，基于Diffusion Forcing架构，支持文本到视频、图像到视频的高质量生成，适用于故事创作、广告制作和影视特效等多种场景。本次开源了1.3B、5B、13B多个尺寸版本。

模型合集

https://modelscope.cn/collections/SkyReels-V2-f665650130b144

示例代码

10秒视频同步生成


model_id=Skywork/SkyReels-V2-DF-14B-540P
# synchronous inference
python3 generate_video_df.py \
  --model_id ${model_id} \
  --resolution 540P \
  --ar_step 0 \
  --base_num_frames 97 \
  --num_frames 257 \
  --overlap_history 17 \
  --prompt "A graceful white swan with a curved neck and delicate feathers swimming in a serene lake at dawn, its reflection perfectly mirrored in the still water as mist rises from the surface, with the swan occasionally dipping its head into the water to feed." \
  --addnoise_condition 20 \
  --offload \
  --teacache \
  --use_ret_steps \
  --teacache_thresh 0.3

30秒视频异步生成


model_id=Skywork/SkyReels-V2-DF-14B-540P
# asynchronous inference
python3 generate_video_df.py \
  --model_id ${model_id} \
  --resolution 540P \
  --ar_step 5 \
  --causal_block_size 5 \
  --base_num_frames 97 \
  --num_frames 737 \
  --overlap_history 17 \
  --prompt "A graceful white swan with a curved neck and delicate feathers swimming in a serene lake at dawn, its reflection perfectly mirrored in the still water as mist rises from the surface, with the swan occasionally dipping its head into the water to feed." \
  --addnoise_condition 20 \
  --offload