Skip to content

Commit 4a0308b

Browse files
authored
Uploaded videos for DJ-features quick view (#738)
* dj quick view videos * dj quick view videos * rollback error fig links
1 parent b3776ae commit 4a0308b

File tree

2 files changed

+19
-3
lines changed

2 files changed

+19
-3
lines changed

README.md

Lines changed: 10 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@
44
# Data Processing for and with Foundation Models
55

66
<img src="https://img.alicdn.com/imgextra/i1/O1CN01fUfM5A1vPclzPQ6VI_!!6000000006165-0-tps-1792-1024.jpg" width = "533" height = "300" alt="Data-Juicer"/>
7-
7+
88
![](https://img.shields.io/badge/language-Python-214870.svg)
99
![](https://img.shields.io/badge/license-Apache--2.0-000000.svg)
1010
[![pypi version](https://img.shields.io/pypi/v/py-data-juicer?logo=pypi&color=026cad)](https://pypi.org/project/py-data-juicer)
@@ -36,6 +36,14 @@ We provide a [playground](http://8.138.149.181/) with a managed JupyterLab. [Try
3636

3737
Data-Juicer is being actively updated and maintained. We will periodically enhance and add more features, data recipes and datasets. We welcome you to join us (via issues, PRs, [Slack](https://join.slack.com/t/data-juicer/shared_invite/zt-23zxltg9d-Z4d3EJuhZbCLGwtnLWWUDg?spm=a2c22.12281976.0.0.7a8253f30mgpjw) channel, [DingDing](https://qr.dingtalk.com/action/joingroup?code=v1,k1,YFIXM2leDEk7gJP5aMC95AfYT+Oo/EP/ihnaIEhMyJM=&_dt_no_comment=1&origin=11) group, ...), in promoting data-model co-development along with research and applications of foundation models!
3838

39+
[Demo Video] DataJuicer-Agent: Quick start your data processing journey!
40+
41+
https://github.com/user-attachments/assets/58aea900-e51f-4ec2-b1c0-eead97967893
42+
43+
[Demo Video] DataJuicer-Sandbox: Better data-model co-dev at a lower cost!
44+
45+
https://github.com/user-attachments/assets/a45f0eee-0f0e-4ffe-9a42-d9a55370089d
46+
3947

4048
## News
4149
- 🛠️ [2025-06-04] How to process feedback data in the "era of experience"? We propose [Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of LLMs](https://arxiv.org/abs/2505.17826), which leverages Data-Juicer for its data pipelines tailored for RFT scenarios.
@@ -76,7 +84,7 @@ Besides, our paper is also updated to [v3](https://arxiv.org/abs/2309.02033).
7684

7785
## Why Data-Juicer?
7886

79-
<img src="https://img.alicdn.com/imgextra/i2/O1CN01EteoQ31taUweAW1UE_!!6000000005918-2-tps-4034-4146.png" align="center" width="600" />
87+
<img src="https://img.alicdn.com/imgextra/i4/O1CN015URK6i21KU3XdkUpK_!!6000000006966-2-tps-3994-3956.png" align="center" width="500" />
8088

8189
- **Systematic & Reusable**:
8290
Empowering users with a systematic library of 100+ core [OPs](docs/Operators.md), and 50+ reusable config recipes and

README_ZH.md

Lines changed: 9 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -30,6 +30,14 @@ Data-Juicer 是一个一站式系统,面向大模型的文本及多模态数
3030

3131
Data-Juicer正在积极更新和维护中,我们将定期强化和新增更多的功能和数据菜谱。热烈欢迎您加入我们(issues/PRs/[Slack频道](https://join.slack.com/t/data-juicer/shared_invite/zt-23zxltg9d-Z4d3EJuhZbCLGwtnLWWUDg?spm=a2c22.12281976.0.0.7a8275bc8g7ypp) /[钉钉群](https://qr.dingtalk.com/action/joingroup?code=v1,k1,YFIXM2leDEk7gJP5aMC95AfYT+Oo/EP/ihnaIEhMyJM=&_dt_no_comment=1&origin=11)/...),一起推进大模型的数据-模型协同开发和研究应用!
3232

33+
[Demo Video] DataJuicer-Agent:数据处理,即刻启程!
34+
35+
https://github.com/user-attachments/assets/58aea900-e51f-4ec2-b1c0-eead97967893
36+
37+
[Demo Video] DataJuicer-Sandbox: 降本增效,优化数据-模型协同开发!
38+
39+
https://github.com/user-attachments/assets/a45f0eee-0f0e-4ffe-9a42-d9a55370089d
40+
3341

3442
----
3543

@@ -75,7 +83,7 @@ Data-Juicer正在积极更新和维护中,我们将定期强化和新增更多
7583

7684
## 为什么选择 Data-Juicer?
7785

78-
<img src="https://img.alicdn.com/imgextra/i2/O1CN01EteoQ31taUweAW1UE_!!6000000005918-2-tps-4034-4146.png" align="center" width="600" />
86+
<img src="https://img.alicdn.com/imgextra/i4/O1CN015URK6i21KU3XdkUpK_!!6000000006966-2-tps-3994-3956.png" align="center" width="500" />
7987

8088
- **系统化和可重用**
8189
系统化地为用户提供 100 多个核心 [算子](docs/Operators.md) 和 50 多个可重用的数据菜谱和

0 commit comments

Comments
 (0)