AI
Audio and Multimedia
Big Data
Core Technology for TCS
Extensible Internet
Networking and Security
Research Initiatives
Speech
Usable Security and Privacy
Vision
2i2c
- 2i2c
Robust Deep Learning
- Resilient Dynamic Autoencoders for Modeling and Predicting Earthquake Threats
- Backdoor Detection via Eigenvalues, Hessians, Internal Behaviors, and Robust Statistics

Multimodal Location Estimation

Location estimation is the task of estimating the geo-coordinates of the content recorded in digital media The Berkeley Multimodal Location Estimation project aims to leverage the GPS-tagged media available on the web as training set for an automatic location estimator. The idea is that visual and acoustic cues can narrow down the possible recording location for a given image, video, or audio track. We also investigate the human baseline of location estimation, i.e. how well does a human do in comparison to a computer?

This is a collaboration with the computer vision group at ICSI as well as with the UC Berkeley BASiCS group (Berkeley Audio Visual Signal Processing and Communication Systems).

More information on this project can be found at http://multimedia.icsi.berkeley.edu/multimodal-location-estimation.

Main menu

Multimodal Location Estimation

Quick Links

Research Areas

Projects

Visitor Information

Follow ICSI

Search form

Main menu

Multimodal Location Estimation

Quick Links

Research Areas

Projects

Visitor Information

Follow ICSI