Mojito: LLM-Aided Motion Instructor with Jitter-Reduced Inertial Tokens

Ziwei Shan^1,*, Yaoyu He^1,*, Chengfeng Zhao^1,*,†, Jiashen Du¹,
Jingyan Zhang¹, Qixuan Zhang^1,2, Jingyi Yu^1,‡, Lan Xu^1,‡

¹ShanghaiTech University ²Deemos Technology
^*Equal contribution
^†Project lead ^‡Corresponding author

🚀 Getting Started

1. Environment Setup

We tested our environment on Ubuntu 20.04 LTS and Windows 11 with CUDA 12.1.

conda create python=3.10 --name mojito
conda activate mojito

conda install pytorch==2.5.0 torchvision==0.20.0 torchaudio==2.5.0 pytorch-cuda=12.1 -c pytorch -c nvidia
pip install -r requirements.txt

# ignore deepspeed installation if using Win 11
DS_BUILD_OPS=1 DS_BUILD_CUTLASS_OPS=0 DS_BUILD_RAGGED_DEVICE_OPS=0 DS_BUILD_EVOFORMER_ATTN=0 pip install deepspeed

conda install -c fvcore -c iopath -c conda-forge fvcore iopath
pip install "git+https://github.com/facebookresearch/pytorch3d.git@stable"

pip install "fastapi[standard]"

2. Prepare Body Model and Weights

Download SMPL-H (the extended SMPL+H model) and put the models under body_model/ folder. The structure of body_model/ folder should be:

body_model/
|--body_model.py
|--utils.py
|--smplh/
|----info.txt
|----LICENSE.txt
|----female/
|------model.npz
|----male/
|------model.npz
|----neutral/
|------model.npz

3. Download pretrained imu tokenizer model

We are releasing the IMU tokenizer model mojito_imu_tokenizer.pth. To set up:

Download the model checkpoint.
Create a checkpoints/ directory in your project if it doesn't exist.
Place the downloaded file in checkpoints/mojito_imu_tokenizer.pth.

4. Example

Run the processing script

python -m example --cfg configs/config_imu_tokenizer.yaml --nodebug

🏄‍♂️ Contributors

Ziwei Shan - koyui
Yaoyu He - TropinoneH
Chengfeng Zhao - AfterJourney00
Jiashen Du - ALT-JS

📖 Citation

If you find our code or paper helps, please consider citing:

@article{shan2025mojito,
  title   = {Mojito: LLM-Aided Motion Instructor with Jitter-Reduced Inertial Tokens},
  author  = {Shan, Ziwei and He, Yaoyu and Du, Jiashen and Zhao, Chengfeng and Zhang, Jingyan and 
             Zhang, Qixuan and Yu, Jingyi and Xu, Lan},
  journal = {arXiv preprint arXiv:},
  year    = {2025}
}

Acknowledgments

Thanks to the following work that we refer to and benefit from:

MotionGPT: the overall framework;
Qwen2: the causal language model;
EgoEgo: the SMPL-H body model script;
TransPose: the data pre-processing of TotalCapture dataset;
SmoothNet: SMPL pose smoother

Licenses

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
assets		assets
configs		configs
demo/20240917_163635		demo/20240917_163635
mojito		mojito
prepare/pre_process		prepare/pre_process
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
example.py		example.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Mojito: LLM-Aided Motion Instructor with Jitter-Reduced Inertial Tokens

🚀 Getting Started

1. Environment Setup

2. Prepare Body Model and Weights

3. Download pretrained imu tokenizer model

4. Example

🏄‍♂️ Contributors

📖 Citation

Acknowledgments

Licenses

About

Uh oh!

Contributors 2

Uh oh!

Languages

License

koyui/mojito

Folders and files

Latest commit

History

Repository files navigation

Mojito: LLM-Aided Motion Instructor with Jitter-Reduced Inertial Tokens

🚀 Getting Started

1. Environment Setup

2. Prepare Body Model and Weights

3. Download pretrained imu tokenizer model

4. Example

🏄‍♂️ Contributors

📖 Citation

Acknowledgments

Licenses

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors 2

Uh oh!

Languages