Add risc-v vector extension tests #1003

nadime15 · 2025-06-04T16:01:29Z

This commit adds a submodule containing precompiled tests (around ~1800 tests) for the risc-v vector extension and integrates them into the CMake testing workflow.

Due to their long runtime, these tests are executed weekly via scheduled CI runs or can be triggered manually. Developers can also run these tests locally by enabling the RVV_TESTS flag during the CMake configuration.

These tests were compiled with:
https://github.com/chipsalliance/riscv-vector-tests/tree/c093610

and with the following ISA options:
--isa=rv64gcv_zvl128b_zve64d_zfh_zfhmin_zvfh

Configurated with:
VLEN=128 and ELEN=64

This is still a work in progress, but I would appreciate early feedback and would like to discuss a few points. For now, I have hosted the binaries myself, but the idea is to move them to an official RISC-V repository soon.

Discussion points:

Should we rename the binaries to include a .elf suffix, similar to what we do for riscv-tests?
Should we also generate disassembly dumps (*.dump) like in riscv-tests?
I set the default VLEN in config.json to 128 (7) to avoid maintaining multiple JSON files.
Personally, I don't think it makes sense to run these tests on every push, so I added a weekly cron job instead. They can be manually triggered by developers or by us in case someone is pushing some changes that effect the vector extension.
I excluded the tests for vector crypto extension because I though I would save much more time but I think at the end I might have saved 2-3 minutes. I think it might makes sense to add them back to the tests?
I reuse existing workflows, which makes no use of caches etc. Should I create a whole seperate workflow or are people ok with the idea to reuse the existing workflow.

The whole workflow takes around ~40 Minutes, here is an example:
https://github.com/nadime15/sail-riscv/actions/runs/15445714226/job/43474657958

github-actions · 2025-06-04T16:26:59Z

Test Results

2 099 tests +1 697 2 099 ✅ +1 697 17m 49s ⏱️ + 14m 55s
1 suites ± 0 0 💤 ± 0
1 files ± 0 0 ❌ ± 0

Results for commit b8e56c2. ± Comparison against base commit 63661df.

This pull request removes 392 and adds 2089 tests. Note that renamed tests count towards both.

rv32d_rv32mi-p-breakpoint.elf ‑ rv32d_rv32mi-p-breakpoint.elf
rv32d_rv32mi-p-csr.elf ‑ rv32d_rv32mi-p-csr.elf
rv32d_rv32mi-p-illegal.elf ‑ rv32d_rv32mi-p-illegal.elf
rv32d_rv32mi-p-ma_addr.elf ‑ rv32d_rv32mi-p-ma_addr.elf
rv32d_rv32mi-p-ma_fetch.elf ‑ rv32d_rv32mi-p-ma_fetch.elf
rv32d_rv32mi-p-mcsr.elf ‑ rv32d_rv32mi-p-mcsr.elf
rv32d_rv32mi-p-sbreak.elf ‑ rv32d_rv32mi-p-sbreak.elf
rv32d_rv32mi-p-scall.elf ‑ rv32d_rv32mi-p-scall.elf
rv32d_rv32mi-p-shamt.elf ‑ rv32d_rv32mi-p-shamt.elf
rv32d_rv32si-p-csr.elf ‑ rv32d_rv32si-p-csr.elf
…

rv32d_2025-06-22/riscv-tests/rv32mi-p-breakpoint ‑ rv32d_2025-06-22/riscv-tests/rv32mi-p-breakpoint
rv32d_2025-06-22/riscv-tests/rv32mi-p-csr ‑ rv32d_2025-06-22/riscv-tests/rv32mi-p-csr
rv32d_2025-06-22/riscv-tests/rv32mi-p-illegal ‑ rv32d_2025-06-22/riscv-tests/rv32mi-p-illegal
rv32d_2025-06-22/riscv-tests/rv32mi-p-instret_overflow ‑ rv32d_2025-06-22/riscv-tests/rv32mi-p-instret_overflow
rv32d_2025-06-22/riscv-tests/rv32mi-p-lh-misaligned ‑ rv32d_2025-06-22/riscv-tests/rv32mi-p-lh-misaligned
rv32d_2025-06-22/riscv-tests/rv32mi-p-lw-misaligned ‑ rv32d_2025-06-22/riscv-tests/rv32mi-p-lw-misaligned
rv32d_2025-06-22/riscv-tests/rv32mi-p-ma_addr ‑ rv32d_2025-06-22/riscv-tests/rv32mi-p-ma_addr
rv32d_2025-06-22/riscv-tests/rv32mi-p-ma_fetch ‑ rv32d_2025-06-22/riscv-tests/rv32mi-p-ma_fetch
rv32d_2025-06-22/riscv-tests/rv32mi-p-mcsr ‑ rv32d_2025-06-22/riscv-tests/rv32mi-p-mcsr
rv32d_2025-06-22/riscv-tests/rv32mi-p-pmpaddr ‑ rv32d_2025-06-22/riscv-tests/rv32mi-p-pmpaddr
…

♻️ This comment has been updated with latest results.

arichardson · 2025-06-04T16:42:42Z

Since this already set up as a weekly run would it be possible to build it as part of the CI flow? Using something like ExternalProject_Add() with a fixed git commit hash?

nadime15 · 2025-06-04T17:19:43Z

I think we decided to add the tests as a git submodule instead of using an external project. I purposely didn’t fix it to a specific commit hash so it tracks master. But honestly, I don't mind either way.

Alasdair · 2025-06-04T20:38:47Z

I think the idea was we could create a separate repository that would pull these tests in as a submodule, then that repository would have a weekly job that would build the tests how we need them. That job could upload them as a release that we could pull into a job on this repo to test more regularly.

arichardson · 2025-06-04T21:26:54Z

I think the idea was we could create a separate repository that would pull these tests in as a submodule, then that repository would have a weekly job that would build the tests how we need them. That job could upload them as a release that we could pull into a job on this repo to test more regularly.

That's also fine. Since git is not a great way of storing binaries, I'd much prefer having it stored as a github release.

nadime15 · 2025-06-04T21:37:34Z

That's also fine. Since git is not a great way of storing binaries, I'd much prefer having it stored as a github release.

Sure sounds good!

I think the idea was we could create a separate repository that would pull these tests in as a submodule, then that repository would have a weekly job that would build the tests how we need them. That job could upload them as a release that we could pull into a job on this repo to test more regularly.

I think I dont understand the part with the submodule then. What do you mean by "...a separate repository that would pull these tests in as a submodule, then that repository would have a weekly job that would build the tests how we need them."

The tests are all auto generated by a script and not manually written, so what exactly do you wanna pull in as a submodule and then build?

From what I understand, we create a separate repository that builds different versions of the vector tests using the vector test suite (with varying VLEN, XLEN, etc.), and publishes weekly releases (probably rather monthly releases because the rvv test suite is basically never changing).

And finally, we use these releases as part of a CI job in this project.

pmundkur · 2025-06-04T21:44:34Z

The separate repo will pull in riscv-tests, riscv-vector-tests and perhaps other test repos as submodules.

nadime15 · 2025-06-04T22:04:20Z

Couldn't I just install them via Github Actions and then build and release them? Whats the advantage of having them as submodules?

arichardson · 2025-06-04T22:27:56Z

Couldn't I just install them via Github Actions and then build and release them? Whats the advantage of having them as submodules?

I agree there is no need to have them as submodules. In my experience submodules just create lots of pain

jordancarlin · 2025-06-05T01:08:02Z

The benefit of submodules in the testing repo instead of just cloning with GitHub Actions during the CI jobs is that it pins the version of the tests we are using to a specific commit. We probably don't want the tests updating randomly because that could introduce unexpected failures. We're better off controlling when we update the sources.

Technically we could pin the version the CI job installs, but if we do it with submodules we could use something like Dependabot to automatically open a PR when there is an updated version of any of the repos available.

pmundkur · 2025-06-06T22:05:50Z

Discussion points:

1. Should we rename the binaries to include a .elf suffix, similar to what we do for riscv-tests?

Not sure why those had an .elf suffix; perhaps easier to glob from a script maybe? As long as the files are easily listed in CMake or a script, it shouldn't matter.

2. Should we also generate disassembly dumps (*.dump) like in riscv-tests?

That's not needed.

3. I set the default VLEN in config.json to 128 (7) to avoid maintaining multiple JSON files.

The repository should have a script to generate the appropriate configs, and a script/CMakeLists to run the simulator (or simulators until we have a unified build) with the appropriate configs for the tests.

4. Personally, I don't think it makes sense to run these tests on every push, so I added a weekly cron job instead. They can be manually triggered by developers or by us in case someone is pushing some changes that effect the vector extension.

Perhaps a daily job is also ok for the vector tests, depending on how long they take. A week might be too long.

5. I excluded the tests for vector crypto extension because I though I would save much more time but I think at the end I might have saved 2-3 minutes. I think it might makes sense to add them back to the tests?

Agreed, it makes sense to add back.

6. I reuse existing workflows, which makes no use of caches etc. Should I create a whole seperate workflow or are people ok with the idea to reuse the existing workflow.
The whole workflow takes around ~40 Minutes, here is an example: https://github.com/nadime15/sail-riscv/actions/runs/15445714226/job/43474657958

Don't know enough to comment.

pmundkur · 2025-06-06T22:13:36Z

Here are some test repos that would be useful to integrate under a unified sail-riscv-test repo. I'm not sure which of these are useful/maintained:

https://github.com/josecm/riscv-hyp-tests
https://github.com/avpatel/xvisor-next/tree/master/tests/riscv
https://github.com/tenstorrent/riscv_arch_tests
https://github.com/riscv-ovpsim/imperas-riscv-tests
https://gitlab.com/kvm-unit-tests/kvm-unit-tests/-/tree/master/riscv?ref_type=heads
https://github.com/luisccc/riscv-iopmp-tests
https://github.com/ucb-bar/riscv-torture

jordancarlin · 2025-06-07T01:15:29Z

If the whole workflow only takes 40 minutes, I'd be tempted to have it run on all PRs. The Lean workflow is 30-45+ minutes (depending on the cache), so it wouldn't even extend the total CI time for a PR.

nadime15 · 2025-06-08T22:51:12Z

I have reworked the flow, made a few changes and followed @Alasdair description. (Still needs some work, like clearer step names and general cleanup etc.). I think it doesn’t really make sense to recompile the tests weekly, so instead I followed @jordancarlin idea to use Dependabot. I am not fully there yet, but I have at least enabled Dependabot. Now, Dependabot will monitor changes in the underlying submodules and open a pull request if one of them updates (so far, riscv-test and riscv-vector-test).

A feature I am working on, but have not fully implemented is to auto-merge Dependabots PR if an additional workflow successfully recompiles all tests with the updated commit. If that passes, we automatically merge the PR (which updates the submodule hash) and create a new release based on the latest riscv-test or riscv-vector-test commit.

I created a new repository that compiles tests from riscv-test and riscv-vector-test for VLEN = [512, 256, 128] and ELEN = [64, 32]. Thanks to parallel execution, this works pretty well (it takes around 3h in total). See the image:

I needed to upload all binaries as separate chunks in the release because bundling them into a single tarball exceeds GitHub's 2GB per file limit.

This commit replaces the old workflow I had with a new one that always fetches the latest release and runs (in theory) the full test suite. For now, to save time, I’m only testing riscv-vector-test with VLEN=512 and ELEN=64.

Here is the link to one workflow, which adds about ~2,500 tests.

I was wondering how to handle different VLEN and ELEN values? Should I dynamically create new config files directly in the workflow with the corresponding parameters? We can also simply use the default config file and stick to VLEN = 512 and ELEN = 64 without testing different VLEN/ELEN values.

Edit: I just noticed that the workflow has failed due to:
System.IO.IOException: No space left on device : '/home/runner/runners/2.325.0/_diag/Worker_20250608-221729-utc.log' Unhandled exception. System.IO.IOException: No space left on device : ...

It stopped at around 687/3097 Test

jordancarlin · 2025-06-09T03:10:08Z

This is definitely moving in the right direction. A few thoughts/comments:

I needed to upload all binaries as separate chunks in the release because bundling them into a single tarball exceeds GitHub's 2GB per file limit.

I think it makes more sense to upload each suite as a separate release artifact. So one tarball for riscv-tests and one for each of the riscv-vector-test variants that you are compiling. Then we can have a matrix workflow in the sail repo where a separate job runs each set of tests to cut down the runtime. It should also help avoid file size issues.

I was wondering how to handle different VLEN and ELEN values? Should I dynamically create new config files directly in the workflow with the corresponding parameters? We can also simply use the default config file and stick to VLEN = 512 and ELEN = 64 without testing different VLEN/ELEN values.

What about using jq to modify VLEN/ELEN in the config file directly in the workflow? Seems like the simplest solution. I think testing multiple VLEN/ELEN combinations has value.

Edit: I just noticed that the workflow has failed due to:
System.IO.IOException: No space left on device : '/home/runner/runners/2.325.0/_diag/Worker_20250608-221729-utc.log' > Unhandled exception. System.IO.IOException: No space left on device : ...

It stopped at around 687/3097 Test

I have a script that I've been using in other GitHub CI jobs that increases the available space on the runner from ~24GB to ~61GB by deleting lots of unneeded things. I think that should be plenty of space for this. https://github.com/openhwgroup/cvw/blob/main/.github/scripts/cli-space-cleanup.sh

nadime15 · 2025-06-09T12:42:24Z

I think it makes more sense to upload each suite as a separate release artifact. So one tarball for riscv-tests and one for each of the riscv-vector-test variants that you are compiling. Then we can have a matrix workflow in the sail repo where a separate job runs each set of tests to cut down the runtime. It should also help avoid file size issues.

Yeah, this would kinda work too, the only issue is that once you set VLEN ≥ 1028 (which we might do at some point), you hit the same problem again. No idea if that really matters or if people even care about those cases, but I think it is worth mentioning.

I was thinking if we could have one job that grabs all the parts, reconstructs them into a single tarball, unpacks it, uploads the artifact, and then launches multiple jobs. It’s basically what you’re describing, just with a slightly different starting point.

What about using jq to modify VLEN/ELEN in the config file directly in the workflow? Seems like the simplest solution. I think testing multiple VLEN/ELEN combinations has value.

Ahh ok, I was not familiar with that tool, makes sense!

I have a script that I've been using in other GitHub CI jobs that increases the available space on the runner from ~24GB to ~61GB by deleting lots of unneeded things. I think that should be plenty of space for this. https://github.com/openhwgroup/cvw/blob/main/.github/scripts/cli-space-cleanup.sh

That is helpful! Not 100% sure, but I think ctest (?) stores logs for each test, and that might be why I am running out of space. The vector test logs are insanely long, it might make sense to disable logging if that is really what is causing it.

nadime15 · 2025-06-11T03:20:48Z

I have updated a few things as @jordancarlin suggested. Each test set now gets its own tarball, and the new workflow pulls them all in parallel and runs them:

See:

and one example run:

As discussed on monday I also added the option to run the tests locally. By default, it fetches the riscv-tests and riscv-vector-tests. It checks vlen and elen dynamically and downloads the matching test set if the values are valid.

One thought: might be worth adding a CMake option to enable/disable running vector tests locally (by default should be off). The default config (VLEN=512, ELEN=64) takes over 3h, while something like VLEN=128, ELEN=32 finishes in ~45min. For GitHub workflows on push, we could tweak vlen/elen on the fly with jq if needed.

test/CMakeLists.txt

.github/workflows/run-tests.yml

Timmmm

I feel like there must be a better way to handle testing multiple vlen/elen. This way seems very awkward.

I think the best option would be to have each VLEN/ELEN as an option. Something like this:

foreach (vlen IN ITEMS 128 256 512)
  foreach (elen IN ITEMS 32 64)
    option(ENABLE_RISCV_VECTOR_TESTS_V${vlen}_E{$elen} "Enable the riscv-vector-tests with vlen=${vlen}, elen=${elen}")
    if (ENABLE_RISCV_VECTOR_TESTS_V${vlen}_E{$elen})
       download_riscv_tests(...)
       file(GLOB elfs_rv32d ... )
       file(GLOB elfs_rv32d ... )

        foreach (arch IN ITEMS "rv32d" "rv64d")
            add_custom_command(... call `jq` to generate config JSON ...)
   
           foreach(elf IN LISTS elfs_${arch})
             file(RELATIVE_PATH elf_name "${CMAKE_CURRENT_SOURCE_DIR}" ${elf})

               add_test(
                  NAME "${arch}_${elf_name}"
                    COMMAND
                        $<TARGET_FILE:riscv_sim_${arch}>
                       --config "${CMAKE_CURRENT_BINARY_DIR}/config_${arch}_${vlen}_${elen}.json"
                ${elf}

Then I think you don't even need to change CI at all; just enable those CMake options.

.github/workflows/run-tests.yml

test/CMakeLists.txt

nadime15 · 2025-06-24T13:56:05Z

About your change, I like it. The only thing I am unsure about is using jq, since not everyone has it installed. And even if they do, they might have a version that does not support certain features (mine cant do log2, jq-1.6).

Timmmm · 2025-06-24T14:26:26Z

Yeah and now that I'm thinking about it our JSON format supports comments, which jq presumably doesn't. We don't have any comments yet but it would be nice to have them.

Maybe we should instead use configure_file() with a template like @arichardson was suggesting for #870

.github/workflows/run-tests.yml

nadime15 · 2025-07-04T21:25:58Z

@Timmmm I gave this another try and incorporated all your suggestions. I think the code is cleaner now and easier to understand.

I explicitly disabled tracing for Sail (I assumed the --quiet option for ctest would stop ctest from logging the simulators output, but it does not). Disabling it had a big impact and saved a lot of time and probably resources too, since the tests are now running without any errors.

I have only tried one run so far, but I’ll rerun it to be sure. Here is the workflow.

Updated RISC-V Vector Test Runtimes (New without logging vs Previous)

Job	New Time	Prev Time
run-riscv-vector-tests (128, 32)	23m 27s	40m 10s
run-riscv-vector-tests (128, 64)	28m 33s	45m 36s
run-riscv-vector-tests (256, 32)	41m 28s	1h 13m 19s
run-riscv-vector-tests (256, 64)	50m 08s	1h 22m 31s
run-riscv-vector-tests (512, 32)	1h 31m 38s	2h 43m 56s
run-riscv-vector-tests (512, 64)	1h 58m 02s	3h 16m 15s
run-riscv-tests	10m 13s	16m 44s
Total	6h 3m 29s	10h 18m 31s

- generalize test/CMakeLists.txt - test/riscv-* to gitignore - suppress output of ctest

- Add option to enable/disable testing for riscv-tests and riscv-vector-tests - Remove download from GitHub Actions and handle it in CMake - Remove jq requirement and extract JSON elements purely in CMake

…ME documentation

…hell script

Timmmm

LGTM - fantastic improvement!

jordancarlin

One minor suggestion, but LGTM! This will be a great improvement.

.github/workflows/run-riscv-tests.yml

Timmmm · 2025-07-14T09:25:40Z

Commit message should have been edited before merging this. :-/

pmundkur added the tgmm-agenda Tagged for the next Golden Model meeting agenda. label Jun 6, 2025

nadime15 force-pushed the add_rvv_test branch 2 times, most recently from 0ec93b2 to 2bc4fe5 Compare June 8, 2025 22:32

nadime15 mentioned this pull request Jun 10, 2025

Add support for Zvksed extension #848

Merged

nadime15 force-pushed the add_rvv_test branch from 2bc4fe5 to 4931a41 Compare June 11, 2025 03:20

jordancarlin reviewed Jun 11, 2025

View reviewed changes

test/CMakeLists.txt Outdated Show resolved Hide resolved

test/CMakeLists.txt Outdated Show resolved Hide resolved

.github/workflows/run-tests.yml Outdated Show resolved Hide resolved

.github/workflows/run-tests.yml Outdated Show resolved Hide resolved

nadime15 force-pushed the add_rvv_test branch from ad81dbd to 3b381e3 Compare June 23, 2025 00:14

Timmmm requested changes Jun 24, 2025

View reviewed changes

pmundkur reviewed Jun 24, 2025

View reviewed changes

.github/workflows/run-tests.yml Outdated Show resolved Hide resolved

.github/workflows/run-tests.yml Outdated Show resolved Hide resolved

nadime15 force-pushed the add_rvv_test branch from 3b381e3 to fd8f843 Compare July 4, 2025 21:25

nadime15 added 13 commits July 9, 2025 15:31

Multiple little adjustments

67978d7

- generalize test/CMakeLists.txt - test/riscv-* to gitignore - suppress output of ctest

A few adjustments:

7b75d44

- Add option to enable/disable testing for riscv-tests and riscv-vector-tests - Remove download from GitHub Actions and handle it in CMake - Remove jq requirement and extract JSON elements purely in CMake

Update Sail compiler to 0.19.1

ebf402a

Pin riscv-sail-tests release version in CMake instead of using latest

c22b950

Add --output-on-failure back for testing

89fae19

Simplify workflow and CMake test setup for RVV

12e9ff6

Add CTest labels to exclude slow vector tests from CI and update READ…

2ccf696

…ME documentation

Remove set_tests_properties()

ae11a54

Add cron job to run every Sunday at midnight

c855bce

Externalize risc-v test urls to .url files

426401f

Set ENABLE_RISCV_TESTS by default to off but enable it in CI and in s…

9195557

…hell script

Include release version in download directory structure

61210e0

Add base URL as a CMake option and remove .url files

e78688c

nadime15 force-pushed the add_rvv_test branch from 2557a4b to e78688c Compare July 9, 2025 19:56

Timmmm approved these changes Jul 10, 2025

View reviewed changes

This was referenced Jul 10, 2025

Template JSON config instead of using jq #1136

Open

Move riscv-sail-tests repo to riscv Github organisation #1137

Closed

jordancarlin approved these changes Jul 10, 2025

View reviewed changes

.github/workflows/run-riscv-tests.yml Outdated Show resolved Hide resolved

Simplify run-riscv-tests.yml and remove condition

72c863e

jordancarlin reviewed Jul 11, 2025

View reviewed changes

.github/workflows/run-riscv-tests.yml Outdated Show resolved Hide resolved

Refactor job matrix

b8e56c2

jordancarlin approved these changes Jul 11, 2025

View reviewed changes

pmundkur added will be merged Scheduled to be merged in a few days if nobody objects and removed tgmm-agenda Tagged for the next Golden Model meeting agenda. labels Jul 11, 2025

pmundkur added this pull request to the merge queue Jul 11, 2025

Merged via the queue into riscv:master with commit 5487fa7 Jul 11, 2025
7 checks passed

This was referenced Jul 14, 2025

Tests elf files #114

Closed

Another vector test suite, maybe add this to CI? #391

Closed

Add risc-v vector extension tests #1003

Add risc-v vector extension tests #1003

Uh oh!

Conversation

nadime15 commented Jun 4, 2025

Uh oh!

github-actions bot commented Jun 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Test Results

Uh oh!

arichardson commented Jun 4, 2025

Uh oh!

nadime15 commented Jun 4, 2025

Uh oh!

Alasdair commented Jun 4, 2025

Uh oh!

arichardson commented Jun 4, 2025

Uh oh!

nadime15 commented Jun 4, 2025

Uh oh!

pmundkur commented Jun 4, 2025

Uh oh!

nadime15 commented Jun 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

arichardson commented Jun 4, 2025

Uh oh!

jordancarlin commented Jun 5, 2025

Uh oh!

pmundkur commented Jun 6, 2025

Uh oh!

pmundkur commented Jun 6, 2025

Uh oh!

jordancarlin commented Jun 7, 2025

Uh oh!

nadime15 commented Jun 8, 2025

Uh oh!

jordancarlin commented Jun 9, 2025

Uh oh!

nadime15 commented Jun 9, 2025

Uh oh!

nadime15 commented Jun 11, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Timmmm left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nadime15 commented Jun 24, 2025

Uh oh!

Timmmm commented Jun 24, 2025

Uh oh!

Uh oh!

Uh oh!

nadime15 commented Jul 4, 2025

Uh oh!

Timmmm left a comment

Choose a reason for hiding this comment

Uh oh!

jordancarlin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Timmmm commented Jul 14, 2025

Uh oh!

Uh oh!

github-actions bot commented Jun 4, 2025 •

edited

Loading

nadime15 commented Jun 4, 2025 •

edited

Loading