Monocular Object Distance Estimation

April 6, 2022September 19, 2023 admin

Using a single camera to estimate the distances of objects reduces costs compared to stereo-vision and LiDAR. Although monocular distance estimation has been studied in the literature, previous methods all rely on knowing an object’s class in some way.
In this paper, we aim to circumvent the potential downsides of these kinds of approaches, and provide an alternative technique that does not use any information relating to its class.
We propose a method that combines the change in an object’s appearance over time together with the camera’s motion to estimate the object’s distance to the camera.
Furthermore, we have designed our model to adaptable to new environments: Our model is able to maintain performance across different object detectors, and be easily adapted to new object classes.
We test our model across different scenarios of training and testing on the KITTI MOTS dataset’s ground truth annotations and TrackRCNN and EagerMOT outputs.
These detections are then used to obtain measurements on its detection source agnostic and class agnostic properties.
Our results show that we are able to outperform methods IPM, SVR, and in test environments with multi-class detections.

Q: Why is it not desirable to memorize distances according to object type? Isn’t this how we often estimate depth, by knowing approx. size of objects, and how big they look?
A: This method is desirable and from our results of the term project (and other research), it is highly effective. The caveat however, is that this method is highly specific to the object classification of the dataset. For example, in KITTI, there are 7 different types of objects while in nuScenes, there are 23. A model of this approach can only operate in environments that are very similar to its training data. Meaning that the objects that it’s estimating the distance of must be one of the object classes in the training data. In the case that the user wish to expand the model’s functionality to predict the distances of a new type of object, the model will need to be:

remade with an extra input parameter
retrained with a new dataset that has that additional class of object

in short, this approach is very specific and has zero flexibility on what it can be used for.

Q: Is it possible to train using trackRCNN detections (but still with GT depth values)?
A: Yes! The results are identical (within margin of error) to the model trained on GT detections. This indicates that this approach is portable to different datasets and that if we can improve the fundamental accuracy, this approach will be able to be used on all systems.

Q: how much does using IMU data help?
A: This current approach is based on the mathematical approach that was part of the presentation. If we know:

how much the camera has moved
how much the object has changed

we can predict how far the object is.

Q: Main contributions I am trying to claim:
A: A monocular object distance estimator with the following properties:

Class agnoticity: instead of just memorizing how big certain objects are at certain distances away (then inter/intrapolate), estimate the distances with no object specific information
Dataset agnoticity: maintain performace across GT and other third-party object tracking network’s annotations/outputs

Besides that, in the current method I would also like to claim the preprocessing technique that represent the changes between 3 intervals with 3 channels, but we’ll see if this works out.

4 thoughts on “Monocular Object Distance Estimation”

sanaz
September 4, 2022 at 12:03 pm
Monocular Object Distance Estimation , could i send this project for my freind??
Joe Celine
June 21, 2023 at 10:26 am
Hi,
Are you still in business?
I found a few errors on your site.
Would you like me to send over a screenshot of those errors?
Regards
Joe
- admin
  December 15, 2023 at 2:53 pm
  Hi Joe, please contact us via the contact page. Your feedback help the community.
Pingback: DMODE: Differential Monocular Object Distance Estimation (Class Agnostic) - AI Story Sphere

Monocular Object Distance Estimation

Like this:

Related

4 thoughts on “Monocular Object Distance Estimation”

Leave a Reply Cancel reply

Share this:

Like this:

Related

You May Also Like

ARMCMC: Bayesian Online Full Density Estimation of Model Parameters

MemGPT: Transforming LLMs into Memory Managers

Checkpointing for Memory Efficiency

4 thoughts on “Monocular Object Distance Estimation”

Leave a Reply Cancel reply