StreetLens: Enabling Human-Centered AI Agents for
Neighborhood Assessment from Street View Imagery

Department of Computer Science and Engineering, University of Minnesota

Family Social Science, College of Education and Human Development, University of Minnesota

GeoHCC’25 Paper arXiv Paper Code

Introduction

StreetLens is a human-centered, researcher-configurable workflow, that embeds relevant social science expertise in a vision language model (VLM) for scalable neighborhood environmental assessments. StreetLens mimics the process of trained human coders by grounding the analysis in questions derived from established interview protocols, retrieving relevant street view imagery (SVI), and generating a wide spectrum of semantic annotations from objective features (e.g., the number of cars) to subjective perceptions (e.g., the sense of disorder in an image).


Jupyter Notebooks

1_data_exploration.ipynb 2_assess_neighborhood_environment.ipynb

BibTeX

@misc{kim2025streetlensenablinghumancenteredai, title={StreetLens: Enabling Human-Centered AI Agents for Neighborhood Assessment from Street View Imagery}, author={Jina Kim and Leeje Jang and Yao-Yi Chiang and Guanyu Wang and Michelle Pasco}, year={2025}, eprint={2506.14670}, archivePrefix={arXiv}, primaryClass={cs.HC}, url={https://arxiv.org/abs/2506.14670}, }