|
1 | 1 | # SPARE: Single-Pass Annotation with Reference-Guided Evaluation for Automatic Process Supervision and Reward Modelling
|
2 |
| -[](https://put-here-your-paper.com) |
| 2 | +[](https://arxiv.org/abs/2506.15498) |
3 | 3 | [](https://huggingface.co/UKPLab/Llama-3-8b-spare-prm-math)
|
4 | 4 | [](https://www.apache.org/licenses/LICENSE-2.0)
|
5 | 5 | [](https://www.python.org/)
|
|
12 | 12 |
|
13 | 13 | ## Description:
|
14 | 14 |
|
15 |
| -This repository includes the training, inference and evaluation code used in our Arxiv 2025 paper - [SPARE: Single-Pass Annotation with Reference-Guided Evaluation for Automatic Process Supervision and Reward Modelling](). |
| 15 | +This repository includes the training, inference and evaluation code used in our paper - [SPARE: Single-Pass Annotation with Reference-Guided Evaluation for Automatic Process Supervision and Reward Modelling](https://arxiv.org/abs/2506.15498). |
16 | 16 |
|
17 | 17 | We introduced a principled framework for a single-pass alignment and step-annotation for automatic process supervision. Process Reward Models (SPARE-PRMs) trained based on the proposed annotation scheme outperform baselines such as Self-Consistency and ORM-weighted aggregation on four datasets across mathematical, question-answering and spatial reasoning datasets. The annotation scheme is also competitive while being computationally efficient compared to tree-search based annotation methods.
|
18 | 18 |
|
@@ -128,14 +128,14 @@ If you want to set up this template:
|
128 | 128 | If you use this repository, our trained SPARE-PRM model or our work, please cite:
|
129 | 129 |
|
130 | 130 | ```
|
131 |
| -@misc{rizvi2024spare, |
| 131 | +@misc{rizvi2025sparesinglepassannotationreferenceguided, |
132 | 132 | title={SPARE: Single-Pass Annotation with Reference-Guided Evaluation for Automatic Process Supervision and Reward Modelling},
|
133 | 133 | author={Md Imbesat Hassan Rizvi and Xiaodan Zhu and Iryna Gurevych},
|
134 | 134 | year={2025},
|
135 |
| - eprint={}, |
| 135 | + eprint={2506.15498}, |
136 | 136 | archivePrefix={arXiv},
|
137 | 137 | primaryClass={cs.CL},
|
138 |
| - url={https://arxiv.org/abs/}, |
| 138 | + url={https://arxiv.org/abs/2506.15498}, |
139 | 139 | }
|
140 | 140 | ```
|
141 | 141 |
|
|
0 commit comments