Skip to content

Commit 0459d8d

Browse files
committed
Updated README with Arxiv details
1 parent f538147 commit 0459d8d

File tree

1 file changed

+5
-5
lines changed

1 file changed

+5
-5
lines changed

README.md

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# SPARE: Single-Pass Annotation with Reference-Guided Evaluation for Automatic Process Supervision and Reward Modelling
2-
[![Arxiv](https://img.shields.io/badge/Arxiv-YYMM.NNNNN-red?style=flat-square&logo=arxiv&logoColor=white)](https://put-here-your-paper.com)
2+
[![Arxiv](https://img.shields.io/badge/Arxiv-YYMM.NNNNN-red?style=flat-square&logo=arxiv&logoColor=white)](https://arxiv.org/abs/2506.15498)
33
[![Hugging Face Model](https://img.shields.io/badge/HuggingFace-Model-yellow)](https://huggingface.co/UKPLab/Llama-3-8b-spare-prm-math)
44
[![License](https://img.shields.io/badge/License-Apache_2.0-blue.svg)](https://www.apache.org/licenses/LICENSE-2.0)
55
[![Python Versions](https://img.shields.io/badge/Python-3.10-blue.svg?style=flat&logo=python&logoColor=white)](https://www.python.org/)
@@ -12,7 +12,7 @@
1212

1313
## Description:
1414

15-
This repository includes the training, inference and evaluation code used in our Arxiv 2025 paper - [SPARE: Single-Pass Annotation with Reference-Guided Evaluation for Automatic Process Supervision and Reward Modelling]().
15+
This repository includes the training, inference and evaluation code used in our paper - [SPARE: Single-Pass Annotation with Reference-Guided Evaluation for Automatic Process Supervision and Reward Modelling](https://arxiv.org/abs/2506.15498).
1616

1717
We introduced a principled framework for a single-pass alignment and step-annotation for automatic process supervision. Process Reward Models (SPARE-PRMs) trained based on the proposed annotation scheme outperform baselines such as Self-Consistency and ORM-weighted aggregation on four datasets across mathematical, question-answering and spatial reasoning datasets. The annotation scheme is also competitive while being computationally efficient compared to tree-search based annotation methods.
1818

@@ -128,14 +128,14 @@ If you want to set up this template:
128128
If you use this repository, our trained SPARE-PRM model or our work, please cite:
129129

130130
```
131-
@misc{rizvi2024spare,
131+
@misc{rizvi2025sparesinglepassannotationreferenceguided,
132132
title={SPARE: Single-Pass Annotation with Reference-Guided Evaluation for Automatic Process Supervision and Reward Modelling},
133133
author={Md Imbesat Hassan Rizvi and Xiaodan Zhu and Iryna Gurevych},
134134
year={2025},
135-
eprint={},
135+
eprint={2506.15498},
136136
archivePrefix={arXiv},
137137
primaryClass={cs.CL},
138-
url={https://arxiv.org/abs/},
138+
url={https://arxiv.org/abs/2506.15498},
139139
}
140140
```
141141

0 commit comments

Comments
 (0)