DCT-Net/README.md

# DCT-Net: Domain-Calibrated Translation for Portrait Stylization

### [Project page](https://menyifang.github.io/projects/DCTNet/DCTNet.html) |  [Video](https://www.youtube.com/watch?v=Y8BrfOjXYQM) | [Paper](https://arxiv.org/abs/2207.02426)

Official implementation of DCT-Net for Full-body Portrait Stylization.


> [**DCT-Net: Domain-Calibrated Translation for Portrait Stylization**](arxiv_url_coming_soon),             
> [Yifang Men](https://menyifang.github.io/)<sup>1</sup>, Yuan Yao<sup>1</sup>, Miaomiao Cui<sup>1</sup>, [Zhouhui Lian](https://www.icst.pku.edu.cn/zlian/)<sup>2</sup>, Xuansong Xie<sup>1</sup>,        
> _<sup>1</sup>[DAMO Academy, Alibaba Group](https://damo.alibaba.com), Beijing, China_  
> _<sup>2</sup>[Wangxuan Institute of Computer Technology, Peking University](https://www.icst.pku.edu.cn/), China_     
> In: SIGGRAPH 2022 (**TOG**) 
> *[arXiv preprint](https://arxiv.org/abs/2207.02426)* 

<a href="https://colab.research.google.com/github/menyifang/DCT-Net/blob/main/notebooks/inference.ipynb"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="google colab logo"></a> 
[![Hugging Face Spaces](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Spaces-blue)](https://huggingface.co/spaces/SIGGRAPH2022/DCT-Net)


## Demo
![demo](assets/demo.gif)


## News

(2023-03-14) The training guidance has been released, train DCT-Net with your own style data.

(2023-02-20) Two new style pre-trained models (design, illustration) trained with combined DCT-Net and Stable-Diffusion are provided. The training guidance will be released soon.

(2022-10-09) The multi-style pre-trained models (3d, handdrawn, sketch, artstyle) and usage are available now. 

(2022-08-08) The pertained model and infer code of 'anime' style is available now. More styles coming soon.

(2022-08-08) cartoon function can be directly call from pythonSDK.

(2022-07-07) The paper is available now at arxiv(https://arxiv.org/abs/2207.02426).


## Web Demo
- Integrated into [Colab notebook](https://colab.research.google.com/github/menyifang/DCT-Net/blob/main/notebooks/inference.ipynb). Try out the colab demo.<a href="https://colab.research.google.com/github/menyifang/DCT-Net/blob/main/notebooks/inference.ipynb"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="google colab logo"></a> 

- Integrated into [Huggingface Spaces 🤗](https://huggingface.co/spaces) using [Gradio](https://github.com/gradio-app/gradio). Try out the Web Demo [![Hugging Face Spaces](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Spaces-blue)](https://huggingface.co/spaces/SIGGRAPH2022/DCT-Net)

- [Chinese version] Integrated into [ModelScope](https://modelscope.cn/#/models). Try out the Web Demo [![ModelScope Spaces](
https://img.shields.io/badge/ModelScope-Spaces-blue)](https://modelscope.cn/#/models/damo/cv_unet_person-image-cartoon_compound-models/summary)

## Requirements
* python 3
* tensorflow (>=1.14, training only support tf1.x)
* easydict
* numpy
* both CPU/GPU are supported


## Quick Start
<a href="https://colab.research.google.com/github/menyifang/DCT-Net/blob/main/notebooks/inference.ipynb"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="google colab logo"></a> 


```bash
git clone https://github.com/menyifang/DCT-Net.git
cd DCT-Net

```

### Installation
```bash
conda create -n dctnet python=3.7
conda activate dctnet
pip install --upgrade tensorflow-gpu==1.15 # GPU support, use tensorflow for CPU only
pip install "modelscope[cv]==1.3.2" -f https://modelscope.oss-cn-beijing.aliyuncs.com/releases/repo.html
pip install "modelscope[multi-modal]==1.3.2" -f https://modelscope.oss-cn-beijing.aliyuncs.com/releases/repo.html
```

### Downloads

| [<img src="assets/sim_anime.png" width="200px">](https://modelscope.cn/models/damo/cv_unet_person-image-cartoon_compound-models/summary) | [<img src="assets/sim_3d.png" width="200px">](https://modelscope.cn/models/damo/cv_unet_person-image-cartoon-3d_compound-models/summary) | [<img src="assets/sim_handdrawn.png" width="200px">](https://modelscope.cn/models/damo/cv_unet_person-image-cartoon-handdrawn_compound-models/summary)| [<img src="assets/sim_sketch.png" width="200px">](https://modelscope.cn/models/damo/cv_unet_person-image-cartoon-sketch_compound-models/summary)| [<img src="assets/sim_artstyle.png" width="200px">](https://modelscope.cn/models/damo/cv_unet_person-image-cartoon-artstyle_compound-models/summary)|
|:--:|:--:|:--:|:--:|:--:| 
| [anime](https://modelscope.cn/models/damo/cv_unet_person-image-cartoon_compound-models/summary) | [3d](https://modelscope.cn/models/damo/cv_unet_person-image-cartoon-3d_compound-models/summary) | [handdrawn](https://modelscope.cn/models/damo/cv_unet_person-image-cartoon-handdrawn_compound-models/summary) | [sketch](https://modelscope.cn/models/damo/cv_unet_person-image-cartoon-sketch_compound-models/summary) | [artstyle](https://modelscope.cn/models/damo/cv_unet_person-image-cartoon-artstyle_compound-models/summary) | 

| [<img src="assets/sim_design.png" width="200px">](https://modelscope.cn/models/damo/cv_unet_person-image-cartoon-sd-design_compound-models/summary) | [<img src="assets/sim_illu.png" width="200px">](https://modelscope.cn/models/damo/cv_unet_person-image-cartoon-sd-illustration_compound-models/summary) |
|:--:|:--:| 
| [design](https://modelscope.cn/models/damo/cv_unet_person-image-cartoon-sd-design_compound-models/summary) | [illustration](https://modelscope.cn/models/damo/cv_unet_person-image-cartoon-sd-illustration_compound-models/summary)

Pre-trained models in different styles can be downloaded by
```bash
python download.py
```

### Inference

- from python SDK
```bash
python run_sdk.py
```

- from source code
```bash
python run.py
```

### Video cartoonization

![demo_vid](assets/video.gif)

video can be directly processed as image sequences, style choice [option: anime, 3d, handdrawn, sketch, artstyle, sd-design, sd-illustration]

```bash
python run_vid.py --style anime
```


## Training
<a href="https://colab.research.google.com/github/menyifang/DCT-Net/blob/main/notebooks/fastTrain.ipynb"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="google colab logo"></a> 

### Data preparation
```
face_photo: face dataset such as [FFHQ](https://github.com/NVlabs/ffhq-dataset) or other collected real faces.
face_cartoon: 100-300 cartoon face images in a specific style, which can be self-collected or synthsized with generative models.
```
Due to the copyrighe issues, we can not provide collected cartoon exemplar for training. You can produce cartoon exemplars with the style-finetuned Stable-Diffusion (SD) models, which can be downloaded from modelscope or huggingface hubs.

The effects of some style-finetune SD models are as follows:

| [<img src="assets/sim1.png" width="240px">](https://modelscope.cn/models/damo/cv_cartoon_stable_diffusion_design/summary) | [<img src="assets/sim2.png" width="240px">](https://modelscope.cn/models/damo/cv_cartoon_stable_diffusion_watercolor) | [<img src="assets/sim3.png" width="240px">](https://modelscope.cn/models/damo/cv_cartoon_stable_diffusion_illustration/summary)| [<img src="assets/sim4.png" width="240px">](https://modelscope.cn/models/damo/cv_cartoon_stable_diffusion_clipart/summary)| [<img src="assets/sim5.png" width="240px">](https://modelscope.cn/models/damo/cv_cartoon_stable_diffusion_flat/summary)|
|:--:|:--:|:--:|:--:|:--:| 
| [design](https://modelscope.cn/models/damo/cv_cartoon_stable_diffusion_design/summary) | [watercolor](https://modelscope.cn/models/damo/cv_cartoon_stable_diffusion_watercolor/summary) | [illustration](https://modelscope.cn/models/damo/cv_cartoon_stable_diffusion_illustration/summary) | [clipart](https://modelscope.cn/models/damo/cv_cartoon_stable_diffusion_clipart/summary) | [flat](https://modelscope.cn/models/damo/cv_cartoon_stable_diffusion_flat/summary) | 

- Generate stylized data, style choice [option: clipart, design, illustration, watercolor, flat]
```bash
python generate_data.py --style clipart
```

- preprocess

extract aligned faces from raw style images:
```bash
python extract_align_faces.py --src_dir 'data/raw_style_data'
```

- train content calibration network 

install environment required by [stylegan2-pytorch](https://github.com/rosinality/stylegan2-pytorch)
```bash
cd source/stylegan2
python prepare_data.py '../../data/face_cartoon' --size 256 --out '../../data/stylegan2/traindata'
python train_condition.py --name 'ffhq_style_s256' --path '../../data/stylegan2/traindata' --config config/conf_server_train_condition_shell.json
```

after training, generated content calibrated samples via:
```bash
python style_blend.py --name 'ffhq_style_s256'
python generate_blendmodel.py --name 'ffhq_style_s256' --save_dir '../../data/face_cartoon/syn_style_faces'
```

- geometry calibration

run geometry calibration for both photo and cartoon:
```bash
cd source
python image_flip_agument_parallel.py --data_dir '../data/face_cartoon'
python image_scale_agument_parallel_flat.py --data_dir '../data/face_cartoon'
python image_rotation_agument_parallel_flat.py --data_dir '../data/face_cartoon'
```

- train texture translator

The dataset structure is recommended as:
```
+—data
|   +—face_photo
|   +—face_cartoon
```
resume training from pretrained model in similar style,

style can be chosen from 'anime, 3d, handdrawn, sketch, artstyle, sd-design, sd-illustration'

```bash
python train_localtoon.py --data_dir PATH_TO_YOU_DATA --work_dir PATH_SAVE --style anime
```


## Acknowledgments

Face detector and aligner are adapted from [Peppa_Pig_Face_Engine](https://github.com/610265158/Peppa_Pig_Face_Engine
) and [InsightFace](https://github.com/TreB1eN/InsightFace_Pytorch).


## Citation

If you find this code useful for your research, please use the following BibTeX entry.

```bibtex
@inproceedings{men2022dct,
  title={DCT-Net: Domain-Calibrated Translation for Portrait Stylization},
  author={Men, Yifang and Yao, Yuan and Cui, Miaomiao and Lian, Zhouhui and Xie, Xuansong},
  journal={ACM Transactions on Graphics (TOG)},
  volume={41},
  number={4},
  pages={1--9},
  year={2022},
  publisher={ACM New York, NY, USA}
}
```
Update README.md update 2 years ago			`# DCT-Net: Domain-Calibrated Translation for Portrait Stylization`
uodate 3 years ago
Update README.md 3 years ago			`### [Project page](https://menyifang.github.io/projects/DCTNet/DCTNet.html) \| [Video](https://www.youtube.com/watch?v=Y8BrfOjXYQM) \| [Paper](https://arxiv.org/abs/2207.02426)`
uodate 3 years ago
Update README.md update 2 years ago			`Official implementation of DCT-Net for Full-body Portrait Stylization.`
uodate 3 years ago

Update README.md 3 years ago			`> [DCT-Net: Domain-Calibrated Translation for Portrait Stylization](arxiv_url_coming_soon),`
			`> [Yifang Men](https://menyifang.github.io/)<sup>1</sup>, Yuan Yao<sup>1</sup>, Miaomiao Cui<sup>1</sup>, [Zhouhui Lian](https://www.icst.pku.edu.cn/zlian/)<sup>2</sup>, Xuansong Xie<sup>1</sup>,`
Update README.md 3 years ago			`> _<sup>1</sup>[DAMO Academy, Alibaba Group](https://damo.alibaba.com), Beijing, China_`
Update README.md 3 years ago			`> _<sup>2</sup>[Wangxuan Institute of Computer Technology, Peking University](https://www.icst.pku.edu.cn/), China_`
Update README.md 3 years ago			`> In: SIGGRAPH 2022 (TOG)`
Update README.md 3 years ago			`> [arXiv preprint](https://arxiv.org/abs/2207.02426)`
uodate 3 years ago
update 2 years ago			`<a href="https://colab.research.google.com/github/menyifang/DCT-Net/blob/main/notebooks/inference.ipynb"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="google colab logo"></a>`
			`[![Hugging Face Spaces](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Spaces-blue)](https://huggingface.co/spaces/SIGGRAPH2022/DCT-Net)`

uodate 3 years ago
			`## Demo`
update training code 2 years ago			`![demo](assets/demo.gif)`


			`## News`

			`(2023-03-14) The training guidance has been released, train DCT-Net with your own style data.`

			`(2023-02-20) Two new style pre-trained models (design, illustration) trained with combined DCT-Net and Stable-Diffusion are provided. The training guidance will be released soon.`

			`(2022-10-09) The multi-style pre-trained models (3d, handdrawn, sketch, artstyle) and usage are available now.`

			`(2022-08-08) The pertained model and infer code of 'anime' style is available now. More styles coming soon.`

			`(2022-08-08) cartoon function can be directly call from pythonSDK.`

			`(2022-07-07) The paper is available now at arxiv(https://arxiv.org/abs/2207.02426).`


			`## Web Demo`
			`- Integrated into [Colab notebook](https://colab.research.google.com/github/menyifang/DCT-Net/blob/main/notebooks/inference.ipynb). Try out the colab demo.<a href="https://colab.research.google.com/github/menyifang/DCT-Net/blob/main/notebooks/inference.ipynb"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="google colab logo"></a>`

			`- Integrated into [Huggingface Spaces 🤗](https://huggingface.co/spaces) using [Gradio](https://github.com/gradio-app/gradio). Try out the Web Demo [![Hugging Face Spaces](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Spaces-blue)](https://huggingface.co/spaces/SIGGRAPH2022/DCT-Net)`

			`- [Chinese version] Integrated into [ModelScope](https://modelscope.cn/#/models). Try out the Web Demo [![ModelScope Spaces](`
			`https://img.shields.io/badge/ModelScope-Spaces-blue)](https://modelscope.cn/#/models/damo/cv_unet_person-image-cartoon_compound-models/summary)`

			`## Requirements`
			`* python 3`
			`* tensorflow (>=1.14, training only support tf1.x)`
			`* easydict`
			`* numpy`
			`* both CPU/GPU are supported`


			`## Quick Start`
			`<a href="https://colab.research.google.com/github/menyifang/DCT-Net/blob/main/notebooks/inference.ipynb"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="google colab logo"></a>`


			```bash
			`git clone https://github.com/menyifang/DCT-Net.git`
			`cd DCT-Net`

			```

			`### Installation`
			```bash
			`conda create -n dctnet python=3.7`
			`conda activate dctnet`
			`pip install --upgrade tensorflow-gpu==1.15 # GPU support, use tensorflow for CPU only`
			`pip install "modelscope[cv]==1.3.2" -f https://modelscope.oss-cn-beijing.aliyuncs.com/releases/repo.html`
			`pip install "modelscope[multi-modal]==1.3.2" -f https://modelscope.oss-cn-beijing.aliyuncs.com/releases/repo.html`
			```

			`### Downloads`

			\| [<img src="assets/sim_anime.png" width="200px">](https://modelscope.cn/models/damo/cv_unet_person-image-cartoon_compound-models/summary) \| [<img src="assets/sim_3d.png" width="200px">](https://modelscope.cn/models/damo/cv_unet_person-image-cartoon-3d_compound-models/summary) \| [<img src="assets/sim_handdrawn.png" width="200px">](https://modelscope.cn/models/damo/cv_unet_person-image-cartoon-handdrawn_compound-models/summary)\| [<img src="assets/sim_sketch.png" width="200px">](https://modelscope.cn/models/damo/cv_unet_person-image-cartoon-sketch_compound-models/summary)\| [<img src="assets/sim_artstyle.png" width="200px">](https://modelscope.cn/models/damo/cv_unet_person-image-cartoon-artstyle_compound-models/summary)\|
			`\|:--:\|:--:\|:--:\|:--:\|:--:\|`
			\| [anime](https://modelscope.cn/models/damo/cv_unet_person-image-cartoon_compound-models/summary) \| [3d](https://modelscope.cn/models/damo/cv_unet_person-image-cartoon-3d_compound-models/summary) \| [handdrawn](https://modelscope.cn/models/damo/cv_unet_person-image-cartoon-handdrawn_compound-models/summary) \| [sketch](https://modelscope.cn/models/damo/cv_unet_person-image-cartoon-sketch_compound-models/summary) \| [artstyle](https://modelscope.cn/models/damo/cv_unet_person-image-cartoon-artstyle_compound-models/summary) \|

			`\| [<img src="assets/sim_design.png" width="200px">](https://modelscope.cn/models/damo/cv_unet_person-image-cartoon-sd-design_compound-models/summary) \| [<img src="assets/sim_illu.png" width="200px">](https://modelscope.cn/models/damo/cv_unet_person-image-cartoon-sd-illustration_compound-models/summary) \|`
			`\|:--:\|:--:\|`
			`\| [design](https://modelscope.cn/models/damo/cv_unet_person-image-cartoon-sd-design_compound-models/summary) \| [illustration](https://modelscope.cn/models/damo/cv_unet_person-image-cartoon-sd-illustration_compound-models/summary)`

			`Pre-trained models in different styles can be downloaded by`
			```bash
			`python download.py`
			```

			`### Inference`

			`- from python SDK`
			```bash
			`python run_sdk.py`
			```

			`- from source code`
			```bash
			`python run.py`
			```

			`### Video cartoonization`

			`![demo_vid](assets/video.gif)`

			`video can be directly processed as image sequences, style choice [option: anime, 3d, handdrawn, sketch, artstyle, sd-design, sd-illustration]`

			```bash
			`python run_vid.py --style anime`
			```


update 2 years ago			`## Training`
update training code 2 years ago			`<a href="https://colab.research.google.com/github/menyifang/DCT-Net/blob/main/notebooks/fastTrain.ipynb"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="google colab logo"></a>`
update multi-style models 2 years ago
update 2 years ago			`### Data preparation`
update multi-style models 2 years ago			```
update 2 years ago			`face_photo: face dataset such as [FFHQ](https://github.com/NVlabs/ffhq-dataset) or other collected real faces.`
			`face_cartoon: 100-300 cartoon face images in a specific style, which can be self-collected or synthsized with generative models.`
update multi-style models 2 years ago			```
update 2 years ago			`Due to the copyrighe issues, we can not provide collected cartoon exemplar for training. You can produce cartoon exemplars with the style-finetuned Stable-Diffusion (SD) models, which can be downloaded from modelscope or huggingface hubs.`
update multi-style models 2 years ago
update 2 years ago			`The effects of some style-finetune SD models are as follows:`
update multi-style models 2 years ago
update 2 years ago			\| [<img src="assets/sim1.png" width="240px">](https://modelscope.cn/models/damo/cv_cartoon_stable_diffusion_design/summary) \| [<img src="assets/sim2.png" width="240px">](https://modelscope.cn/models/damo/cv_cartoon_stable_diffusion_watercolor) \| [<img src="assets/sim3.png" width="240px">](https://modelscope.cn/models/damo/cv_cartoon_stable_diffusion_illustration/summary)\| [<img src="assets/sim4.png" width="240px">](https://modelscope.cn/models/damo/cv_cartoon_stable_diffusion_clipart/summary)\| [<img src="assets/sim5.png" width="240px">](https://modelscope.cn/models/damo/cv_cartoon_stable_diffusion_flat/summary)\|
update 2 years ago			`\|:--:\|:--:\|:--:\|:--:\|:--:\|`
			`\| [design](https://modelscope.cn/models/damo/cv_cartoon_stable_diffusion_design/summary) \| [watercolor](https://modelscope.cn/models/damo/cv_cartoon_stable_diffusion_watercolor/summary) \| [illustration](https://modelscope.cn/models/damo/cv_cartoon_stable_diffusion_illustration/summary) \| [clipart](https://modelscope.cn/models/damo/cv_cartoon_stable_diffusion_clipart/summary) \| [flat](https://modelscope.cn/models/damo/cv_cartoon_stable_diffusion_flat/summary) \|`
update multi-style models 2 years ago
update 2 years ago			`- Generate stylized data, style choice [option: clipart, design, illustration, watercolor, flat]`
update multi-style models 2 years ago			```bash
update 2 years ago			`python generate_data.py --style clipart`
update multi-style models 2 years ago			```

update training code 2 years ago			`- preprocess`
update 2 years ago
update training code 2 years ago			`extract aligned faces from raw style images:`
			```bash
			`python extract_align_faces.py --src_dir 'data/raw_style_data'`
			```
update 2 years ago
update training code 2 years ago			`- train content calibration network`
update 2 years ago
update 2 years ago			`install environment required by [stylegan2-pytorch](https://github.com/rosinality/stylegan2-pytorch)`
update training code 2 years ago			```bash
			`cd source/stylegan2`
			`python prepare_data.py '../../data/face_cartoon' --size 256 --out '../../data/stylegan2/traindata'`
update 2 years ago			`python train_condition.py --name 'ffhq_style_s256' --path '../../data/stylegan2/traindata' --config config/conf_server_train_condition_shell.json`
update training code 2 years ago			```

			`after training, generated content calibrated samples via:`
			```bash
			`python style_blend.py --name 'ffhq_style_s256'`
			`python generate_blendmodel.py --name 'ffhq_style_s256' --save_dir '../../data/face_cartoon/syn_style_faces'`
			```

			`- geometry calibration`

			`run geometry calibration for both photo and cartoon:`
			```bash
			`cd source`
			`python image_flip_agument_parallel.py --data_dir '../data/face_cartoon'`
			`python image_scale_agument_parallel_flat.py --data_dir '../data/face_cartoon'`
			`python image_rotation_agument_parallel_flat.py --data_dir '../data/face_cartoon'`
			```

			`- train texture translator`

			`The dataset structure is recommended as:`
			```
			`+—data`
			`\| +—face_photo`
			`\| +—face_cartoon`
			```
update 2 years ago			`resume training from pretrained model in similar style,`
update multi-style models 2 years ago
update training code 2 years ago			`style can be chosen from 'anime, 3d, handdrawn, sketch, artstyle, sd-design, sd-illustration'`

			```bash
			`python train_localtoon.py --data_dir PATH_TO_YOU_DATA --work_dir PATH_SAVE --style anime`
			```
update multi-style models 2 years ago


update 3 years ago
			`## Acknowledgments`

			`Face detector and aligner are adapted from [Peppa_Pig_Face_Engine](https://github.com/610265158/Peppa_Pig_Face_Engine`
			`) and [InsightFace](https://github.com/TreB1eN/InsightFace_Pytorch).`


update 3 years ago
			`## Citation`

			`If you find this code useful for your research, please use the following BibTeX entry.`

			```bibtex
			`@inproceedings{men2022dct,`
			`title={DCT-Net: Domain-Calibrated Translation for Portrait Stylization},`
			`author={Men, Yifang and Yao, Yuan and Cui, Miaomiao and Lian, Zhouhui and Xie, Xuansong},`
			`journal={ACM Transactions on Graphics (TOG)},`
			`volume={41},`
			`number={4},`
			`pages={1--9},`
			`year={2022},`
update 3 years ago			`publisher={ACM New York, NY, USA}`
update 3 years ago			`}`
			```
update 3 years ago