UrbanCross: Enhancing Satellite Image-Text Retrieval with Cross-Domain Adaptation [MM 2024]

This repository contains the implementation of our manuscript titled "UrbanCross: Enhancing Satellite Image-Text Retrieval with Cross-Domain Adaptation", accepted for publication at ACM Multimedia 2024.

Overview

UrbanCross aims to enhance the performance of satellite image-text retrieval tasks by addressing the domain gaps that arise from diverse urban environments. The framework incorporates:

A cross-domain dataset enriched with geo-tags across multiple countries.
Large Multimodal Model (LMM) for textual refinement and Segment Anything Model (SAM) for visual augmentation.
Adaptive curriculum-based sampling and weighted adversarial fine-tuning modules.

As the codebase is extensive and complex, this repository will be actively maintained and updated. The dataset is currently being refined due to its large size and will be released on Hugging Face shortly.

Dataset

The UrbanCross dataset is available on Google Drive. The dataset includes:

.UrbanCross-Dataset
├── Finland
│   ├── image_segments.zip
│   ├── images.zip
│   └── instructblip_generation_finland_refine.csv
├── Germany
│   ├── image_segments.zip
│   ├── images.zip
│   └── instructblip_generation_germany_refine.csv
├── Spain
│   ├── image_segments.zip
│   ├── images.zip
│   └── instructblip_generation_spain_refine.csv

The dataset features high-resolution satellite images from three countries, segmented using the SAM (Segment Anything Model), with each image having ten segments. Text descriptions were generated using the InstructBLIP model.

Usage

Prerequisites

Python 3.8+
PyTorch 1.10+ with CUDA support
Other dependencies listed in requirements.txt

You can install the required Python packages using:

pip install -r requirements.txt

Alternatively, you can create a Conda environment with:

conda create -n urbancross python=3.8
conda activate urbancross
pip install -r requirements.txt

Run

For instructions on how to run the code, please refer to the cmd directory for the respective shell scripts.

.
├── fine-tune
│   ├── finetune_urbancross_curriculum.sh
│   ├── finetune_urbancross.sh
│   └── zeroshot_urbancross.sh
├── test
│   ├── test_urbancross_finland.sh
│   ├── test_urbancross_germany.sh
│   ├── test_urbancross_rsicd.sh
│   ├── test_urbancross_rsitmd.sh
│   ├── test_urbancross_spain.sh
│   ├── test_urbancross_without_sam_finland.sh
│   ├── test_urbancross_without_sam_germany.sh
│   ├── test_urbancross_without_sam_integration.sh
│   ├── test_urbancross_without_sam_rsicd.sh
│   ├── test_urbancross_without_sam_rsitmd.sh
│   └── test_urbancross_without_sam_spain.sh
└── train
    ├── train_urbancross_finland.sh
    ├── train_urbancross_germany.sh
    ├── train_urbancross_rsicd.sh
    ├── train_urbancross_rsitmd.sh
    ├── train_urbancross_spain.sh
    ├── train_urbancross_without_sam_finland.sh
    ├── train_urbancross_without_sam_germany.sh
    ├── train_urbancross_without_sam_integration.sh
    ├── train_urbancross_without_sam_rsicd.sh
    ├── train_urbancross_without_sam_rsitmd.sh
    └── train_urbancross_without_sam_spain.sh

Citation

If you find our work useful in your research, please consider citing:

@article{zhong2024urbancross,
  title={UrbanCross: Enhancing Satellite Image-Text Retrieval with Cross-Domain Adaptation},
  author={Zhong, Siru and Hao, Xixuan and Yan, Yibo and Zhang, Ying and Song, Yangqiu and Liang, Yuxuan},
  journal={arXiv preprint arXiv:2404.14241},
  year={2024}
}

Contact

For any questions or issues, feel free to open an issue or contact the authors:

Siru Zhong: [email protected]
Yuxuan Liang (Corresponding Author): [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 84 Commits
cmd		cmd
data		data
figs		figs
fix_data		fix_data
layers		layers
open_clip_mine		open_clip_mine
utils		utils
vocab		vocab
.gitignore		.gitignore
README.md		README.md
data.py		data.py
engine.py		engine.py
finetune_urbancross.py		finetune_urbancross.py
finetune_urbancross_curriculum.py		finetune_urbancross_curriculum.py
requirements.txt		requirements.txt
segment_anything		segment_anything
test_urbancross.py		test_urbancross.py
test_urbancross_without_sam.py		test_urbancross_without_sam.py
train_urbancross.py		train_urbancross.py
train_urbancross_without_sam.py		train_urbancross_without_sam.py
zeroshot_urbancross.py		zeroshot_urbancross.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

UrbanCross: Enhancing Satellite Image-Text Retrieval with Cross-Domain Adaptation [MM 2024]

Table of Contents

Overview

Dataset

Usage

Prerequisites

Run

Citation

Contact

About

Releases

Packages

Languages

siruzhong/MM24-UrbanCross

Folders and files

Latest commit

History

Repository files navigation

UrbanCross: Enhancing Satellite Image-Text Retrieval with Cross-Domain Adaptation [MM 2024]

Table of Contents

Overview

Dataset

Usage

Prerequisites

Run

Citation

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages