Intel/dpt-large

129次阅读

Intel/dpt-large

Model Detail	Description
Model Authors – Company	Intel
Date	March 22, 2022
Version	1
Type	Computer Vision – Monocular Depth Estimation
Paper or Other Resources	Vision Transformers for Dense Prediction and GitHub Repo
License	Apache 2.0
Questions or Comments	Community Tab and Intel Developers Discord

Model Details: DPT-Large

Dense Prediction Transformer (DPT) model trained on 1.4 million images for monocular depth estimation.
It was introduced in the paper Vision Transformers for Dense Prediction by Ranftl et al. (2021) and first released in this repository.
DPT uses the Vision Transformer (ViT) as backbone and adds a neck + head on top for monocular depth estimation.

The model card has been written in combination by the Hugging Face team and Intel.

Model Detail Description

Model Authors – Company Intel

Date March 22, 2022

Version 1

Type Computer Vision – Monocular Depth Estimation

Paper or Other Resources Vision Transformers for Dense Prediction and GitHub Repo

License Apache 2.0

Questions or Comments Community Tab and Intel Developers Discord