Model Details: DPT-Large
Dense Prediction Transformer (DPT) model trained on 1.4 million images for monocular depth estimation.
It was introduced in the paper Vision Transformers for Dense Prediction by Ranftl et al. (2021) and first released in this repository.
DPT uses the Vision Transformer (ViT) as backbone and adds a neck + head on top for monocular depth estimation.
The model card has been written in combination by the Hugging Face team and Intel.
Model Detail | Description |
---|---|
Model Authors – Company | Intel |
Date | March 22, 2022 |
Version | 1 |
Type | Computer Vision – Monocular Depth Estimation |
Paper or Other Resources | Vision Transformers for Dense Prediction and GitHub Repo |
License | Apache 2.0 |
Questions or Comments | Community Tab and Intel Developers Discord |
前往AI网址导航
正文完