YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding (CVPR 2026 Highlight)

This repository contains the official PyTorch implementation of SpatialScore: https://arxiv.org/abs/2505.17012/.

Our new version paper has been accepted by CVPR 2026, and we have updated our up-to-date code and data! Feel free to reach out for discussions!

Current Leaderboard (You are welcome to test your models on SpatialScore!):

Some Information

Project Page · Paper · SpatialScore_Benchmark · SpatialCorpus · Model

News

  • [2026.5] We have updated our up-to-date code and data!
  • [2026.4] Glad to share that SpatialScore has been accepted to CVPR 2026 and selected as Highlight.
  • [2025.5] We have released version_0 of our evaluation code, supporting most mainstream models.
  • [2025.5] We have released version_0 of SpatialScore, which is available on Huggingface.
  • [2025.5] Our pre-print paper is released on arXiv.

Requirements

  • Python >= 3.10 (Recommend to use Anaconda or Miniconda)
  • PyTorch >= 2.8.0
  • accelerate == 1.13.0
  • xformers==0.0.32.post1
  • flash-attn==2.8.2
  • vllm == 0.11.0
  • triton == 3.4.0
  • triton_kernels (please refer to gpt_oss for version supporting gpt_oss)
  • transformers == 4.57.3

The aforementioned dependencies are necessary for conducting evaluations on SpatialScore. If you intend to utilize SpatialAgent; since it requires invoking various spatial perception tools, you may need to consult the following repositories to install the corresponding tool dependencies, and download their corresponding pre-trained checkpoints, including Rex-Omni, Map-Anything, GroundingDINO and DetAny3D.

A suitable conda environment named SpatialScore can be created and activated with:

conda env create -f environment.yaml
conda activate SpatialScore

Citation

If you use this code, model, and data for your research or project, please cite:

@inproceedings{wu2026spatialscore,
  author    = {Wu, Haoning and Huang, Xiao and Chen, Yaohui and Zhang, Ya and Wang, Yanfeng and Xie, Weidi},
  title     = {SpatialScore: Towards Comprehensive Evaluation for Spatial Intelligence},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
  year    = {2026},

}

TODO

  • Release Paper
  • Update the final version paper
  • Release version_0 SpatialScore Benchmark
  • Release version_0 Code of Evaluation
  • Release version_0 Base Code of SpatialAgent
  • Release our training resources SpatialCorpus and the SFT models
  • Update SpatialScore Benchmark
  • Update Code of Evaluation
  • Update Code of SpatialAgent

Acknowledgements

Many thanks to the code bases from transformers, Qwen3-VL, and TACO.

Contact

If you have any questions, please feel free to contact haoningwu3639@gmail.com.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Paper for haoningwu/SpatialScore