Skip to content

Commit 897fc58

Browse files
committed
update README and fix bugs
1. Updated import paths for MoGeModel to reflect new structure. 2. Added instructions for manually downloading and setting up submodules in README. 3. Included VGGT submodule path in pipelines.py for improved functionality.
1 parent 5dff3dc commit 897fc58

File tree

5 files changed

+15
-3
lines changed

5 files changed

+15
-3
lines changed

README.md

Lines changed: 10 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -33,9 +33,17 @@ pip install git+https://github.com/asomoza/image_gen_aux.git
3333
3434
3. Make sure the submodule and requirements are installed:
3535
```
36+
mkdir -p submodules
3637
git submodule update --init --recursive
3738
pip install -r requirements.txt
3839
```
40+
If the submodules are not installed, you need to manually download them and move them to `submodules/`. Run the following commands to install the submodules:
41+
```
42+
# MoGe
43+
git clone https://github.com/microsoft/MoGe.git submodules/MoGe
44+
# VGGT
45+
git clone https://github.com/facebookresearch/vggt.git submodules/vggt
46+
```
3947
4048
4. Manually download these checkpoints:
4149
- SpatialTracker checkpoint: [Google Drive](https://drive.google.com/drive/folders/1UtzUJLPhJdUg2XvemXXz1oe6KUQKVjsZ) and move it to `checkpoints/`.
@@ -116,7 +124,7 @@ python demo.py \
116124
--output_dir <output_dir> \ # output directory
117125
--input_path <input_path> \ # the reference image or video path
118126
--camera_motion <camera_motion> \ # the camera motion type, see examples below
119-
--tracking_method <tracking_method> \ # the tracking method (moge, spatracker). For image input, 'moge' is necessary.
127+
--tracking_method <tracking_method> \ # the tracking method (moge, spatracker, cotracker). For image input, 'moge' is necessary.
120128
--gpu <gpu_id> \ # the gpu id
121129
```
122130

@@ -345,6 +353,7 @@ This project builds upon several excellent open source projects:
345353

346354
* [MoGe](https://github.com/microsoft/MoGe) - Microsoft's monocular geometry estimation model that helps achieve more accurate 3D reconstruction.
347355

356+
* [vggt](https://github.com/facebookresearch/vggt) - Facebook's video generation model that provides the foundational architecture for this project.
348357
We thank the authors and contributors of these projects for their valuable contributions to the open source community!
349358

350359
## Citation

demo.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -18,7 +18,7 @@
1818
from diffusers.utils import load_image, load_video
1919

2020
from models.pipelines import DiffusionAsShaderPipeline, FirstFrameRepainter, CameraMotionGenerator, ObjectMotionGenerator
21-
from submodules.MoGe.moge.model import MoGeModel
21+
from submodules.MoGe.moge.model.v1 import MoGeModel
2222
from submodules.vggt.vggt.utils.pose_enc import pose_encoding_to_extri_intri
2323
from submodules.vggt.vggt.models.vggt import VGGT
2424

models/pipelines.py

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,6 +6,7 @@
66
project_root = os.path.dirname(os.path.dirname(os.path.abspath(__file__)))
77
try:
88
sys.path.append(os.path.join(project_root, "submodules/MoGe"))
9+
sys.path.append(os.path.join(project_root, "submodules/vggt"))
910
os.environ["TOKENIZERS_PARALLELISM"] = "false"
1011
except:
1112
print("Warning: MoGe not found, motion transfer will not be applied")
@@ -21,7 +22,7 @@
2122
from models.spatracker.utils.visualizer import Visualizer
2223
from models.cogvideox_tracking import CogVideoXImageToVideoPipelineTracking
2324

24-
from submodules.MoGe.moge.model import MoGeModel
25+
from submodules.MoGe.moge.model.v1 import MoGeModel
2526

2627
from image_gen_aux import DepthPreprocessor
2728
from moviepy.editor import ImageSequenceClip

submodules/MoGe

Submodule MoGe added at a8c3734

submodules/vggt

Submodule vggt added at b02cc03

0 commit comments

Comments
 (0)