Update README.md
Browse files
README.md
CHANGED
|
@@ -1,7 +1,6 @@
|
|
| 1 |
---
|
| 2 |
license: cc-by-nc-sa-4.0
|
| 3 |
---
|
| 4 |
-
|
| 5 |
# π CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models
|
| 6 |
|
| 7 |
<div style="display: flex; justify-content: center; align-items: center;">
|
|
@@ -29,12 +28,14 @@ license: cc-by-nc-sa-4.0
|
|
| 29 |
</div>
|
| 30 |
|
| 31 |
|
| 32 |
-
|
| 33 |
**CatVTON** is a simple and efficient virtual try-on diffusion model with ***1) Lightweight Network (899.06M parameters totally)***, ***2) Parameter-Efficient Training (49.57M parameters trainable)*** and ***3) Simplified Inference (< 8G VRAM for 1024X768 resolution)***.
|
| 34 |
|
| 35 |
|
| 36 |
|
| 37 |
## Updates
|
|
|
|
|
|
|
|
|
|
| 38 |
- **`2024/08/10`**: Our π€ [**HuggingFace Space**](https://huggingface.co/spaces/zhengchong/CatVTON) is available now! Thanks for the grant from [**ZeroGPU**](https://huggingface.co/zero-gpu-explorers)οΌ
|
| 39 |
- **`2024/08/09`**: [**Evaluation code**](https://github.com/Zheng-Chong/CatVTON?tab=readme-ov-file#3-calculate-metrics) is provided to calculate metrics π.
|
| 40 |
- **`2024/07/27`**: We provide code and workflow for deploying CatVTON on [**ComfyUI**](https://github.com/Zheng-Chong/CatVTON?tab=readme-ov-file#comfyui-workflow) π₯.
|
|
@@ -47,7 +48,14 @@ license: cc-by-nc-sa-4.0
|
|
| 47 |
|
| 48 |
|
| 49 |
## Installation
|
| 50 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 51 |
|
| 52 |
## Deployment
|
| 53 |
### ComfyUI Workflow
|
|
@@ -63,7 +71,10 @@ To deploy CatVTON to your ComfyUI, follow these steps:
|
|
| 63 |
>
|
| 64 |
When you run the CatVTON workflow for the first time, the weight files will be automatically downloaded, usually taking dozens of minutes.
|
| 65 |
|
| 66 |
-
|
|
|
|
|
|
|
|
|
|
| 67 |
<!-- <div align="center">
|
| 68 |
<img src="resource/img/comfyui.png" width="100%" height="100%"/>
|
| 69 |
</div> -->
|
|
@@ -96,7 +107,7 @@ Once the datasets are downloaded, the folder structures should look like these:
|
|
| 96 |
β β β βββ [000006_00_mask.png | 000008_00.png | ...]
|
| 97 |
...
|
| 98 |
```
|
| 99 |
-
|
| 100 |
```
|
| 101 |
βββ DressCode
|
| 102 |
| βββ test_pairs_paired.txt
|
|
@@ -110,6 +121,11 @@ For the DressCode dataset, we provide [our preprocessed agnostic masks](https://
|
|
| 110 |
β β β βββ [013563_0.png| 013564_0.png | ...]
|
| 111 |
...
|
| 112 |
```
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 113 |
|
| 114 |
### 2. Inference on VTIONHD/DressCode
|
| 115 |
To run the inference on the DressCode or VITON-HD dataset, run the following command, checkpoints will be automatically downloaded from HuggingFace.
|
|
|
|
| 1 |
---
|
| 2 |
license: cc-by-nc-sa-4.0
|
| 3 |
---
|
|
|
|
| 4 |
# π CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models
|
| 5 |
|
| 6 |
<div style="display: flex; justify-content: center; align-items: center;">
|
|
|
|
| 28 |
</div>
|
| 29 |
|
| 30 |
|
|
|
|
| 31 |
**CatVTON** is a simple and efficient virtual try-on diffusion model with ***1) Lightweight Network (899.06M parameters totally)***, ***2) Parameter-Efficient Training (49.57M parameters trainable)*** and ***3) Simplified Inference (< 8G VRAM for 1024X768 resolution)***.
|
| 32 |
|
| 33 |
|
| 34 |
|
| 35 |
## Updates
|
| 36 |
+
- **`2024/10/17`**:[**Mask-free version**](https://huggingface.co/zhengchong/CatVTON-MaskFree)π€ of CatVTON is release and please try it in our [**Online Demo**](http://120.76.142.206:8888).
|
| 37 |
+
- **`2024/10/13`**: We have built a repo [**Awesome-Try-On-Models**](https://github.com/Zheng-Chong/Awesome-Try-On-Models) that focuses on image, video, and 3D-based try-on models published after 2023, aiming to provide insights into the latest technological trends. If you're interested, feel free to contribute or give it a π star!
|
| 38 |
+
- **`2024/08/13`**: We localize DensePose & SCHP to avoid certain environment issues.
|
| 39 |
- **`2024/08/10`**: Our π€ [**HuggingFace Space**](https://huggingface.co/spaces/zhengchong/CatVTON) is available now! Thanks for the grant from [**ZeroGPU**](https://huggingface.co/zero-gpu-explorers)οΌ
|
| 40 |
- **`2024/08/09`**: [**Evaluation code**](https://github.com/Zheng-Chong/CatVTON?tab=readme-ov-file#3-calculate-metrics) is provided to calculate metrics π.
|
| 41 |
- **`2024/07/27`**: We provide code and workflow for deploying CatVTON on [**ComfyUI**](https://github.com/Zheng-Chong/CatVTON?tab=readme-ov-file#comfyui-workflow) π₯.
|
|
|
|
| 48 |
|
| 49 |
|
| 50 |
## Installation
|
| 51 |
+
|
| 52 |
+
Create a conda environment & Install requirments
|
| 53 |
+
```shell
|
| 54 |
+
conda create -n catvton python==3.9.0
|
| 55 |
+
conda activate catvton
|
| 56 |
+
cd CatVTON-main # or your path to CatVTON project dir
|
| 57 |
+
pip install -r requirements.txt
|
| 58 |
+
```
|
| 59 |
|
| 60 |
## Deployment
|
| 61 |
### ComfyUI Workflow
|
|
|
|
| 71 |
>
|
| 72 |
When you run the CatVTON workflow for the first time, the weight files will be automatically downloaded, usually taking dozens of minutes.
|
| 73 |
|
| 74 |
+
<div align="center">
|
| 75 |
+
<img src="resource/img/comfyui-1.png" width="100%" height="100%"/>
|
| 76 |
+
</div>
|
| 77 |
+
|
| 78 |
<!-- <div align="center">
|
| 79 |
<img src="resource/img/comfyui.png" width="100%" height="100%"/>
|
| 80 |
</div> -->
|
|
|
|
| 107 |
β β β βββ [000006_00_mask.png | 000008_00.png | ...]
|
| 108 |
...
|
| 109 |
```
|
| 110 |
+
|
| 111 |
```
|
| 112 |
βββ DressCode
|
| 113 |
| βββ test_pairs_paired.txt
|
|
|
|
| 121 |
β β β βββ [013563_0.png| 013564_0.png | ...]
|
| 122 |
...
|
| 123 |
```
|
| 124 |
+
For the DressCode dataset, we provide script to preprocessed agnostic masks, run the following command:
|
| 125 |
+
```PowerShell
|
| 126 |
+
CUDA_VISIBLE_DEVICES=0 python preprocess_agnostic_mask.py \
|
| 127 |
+
--data_root_path <your_path_to_DressCode>
|
| 128 |
+
```
|
| 129 |
|
| 130 |
### 2. Inference on VTIONHD/DressCode
|
| 131 |
To run the inference on the DressCode or VITON-HD dataset, run the following command, checkpoints will be automatically downloaded from HuggingFace.
|