Following [PR #35124](https://github.com/huggingface/transformers/pull/35124), we will add support for vision transformer models that are suitable for on-device deployment