Skip to content

Actions: confident-ai/deepeval

Lint

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
681 workflow run results
681 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Update benchmarks-MMLU.mdx with MMLUTask
Lint #2019: Pull request #1313 opened by AMindToThink
January 25, 2025 03:10 23s AMindToThink:patch-2
January 25, 2025 03:10 23s
auto-eval
Lint #2017: Pull request #1283 synchronize by penguine-ip
January 24, 2025 08:51 24s kritinv:auto-eval
January 24, 2025 08:51 24s
auto-eval
Lint #2016: Pull request #1283 synchronize by kritinv
January 24, 2025 07:48 26s kritinv:auto-eval
January 24, 2025 07:48 26s
updated task-completion doc
Lint #2014: Pull request #1310 opened by kritinv
January 24, 2025 05:13 22s kritinv:task-completion-docs
January 24, 2025 05:13 22s
make red teamer surface errors
Lint #2013: Pull request #1309 synchronize by ji21
January 24, 2025 02:28 23s feature/red-teaming-error-surface
January 24, 2025 02:28 23s
make red teamer surface errors
Lint #2012: Pull request #1309 synchronize by ji21
January 24, 2025 02:23 22s feature/red-teaming-error-surface
January 24, 2025 02:23 22s
make red teamer surface errors
Lint #2011: Pull request #1309 synchronize by ji21
January 24, 2025 01:14 23s feature/red-teaming-error-surface
January 24, 2025 01:14 23s
Typo fix. "Confidnet" -> "Confident"
Lint #2009: Pull request #1307 opened by r-sniper
January 23, 2025 10:00 24s r-sniper:main
January 23, 2025 10:00 24s
local model/azure fix
Lint #2008: Pull request #1304 opened by kritinv
January 23, 2025 01:52 24s kritinv:local-model-fix
January 23, 2025 01:52 24s
update docs g-eval
Lint #2007: Pull request #1303 opened by kritinv
January 22, 2025 22:50 22s kritinv:g-eval-deepeval-login
January 22, 2025 22:50 22s
task completion metric
Lint #2006: Pull request #1295 synchronize by kritinv
January 22, 2025 22:45 21s kritinv:task-completion
January 22, 2025 22:45 21s
new release
Lint #2005: Pull request #1302 opened by penguine-ip
January 22, 2025 22:20 23s release-v2.2.2
January 22, 2025 22:20 23s
add conditional displaying
Lint #2004: Pull request #1301 synchronize by ji21
January 22, 2025 22:17 21s features/conditional-display
January 22, 2025 22:17 21s
add conditional displaying
Lint #2003: Pull request #1301 opened by penguine-ip
January 22, 2025 22:15 26s features/conditional-display
January 22, 2025 22:15 26s
Exclude tests folder during installation
Lint #2002: Pull request #1300 opened by fj11
January 22, 2025 20:11 24s fj11:exclude-tests-folder
January 22, 2025 20:11 24s
add validated openai models
Lint #2001: Pull request #1299 opened by luarss
January 22, 2025 16:30 27s luarss:topic/openai-model-fix2
January 22, 2025 16:30 27s
feature status telemetry
Lint #2000: Pull request #1296 opened by kritinv
January 22, 2025 04:11 27s kritinv:telemetry-new-features
January 22, 2025 04:11 27s
task completion metric
Lint #1999: Pull request #1295 opened by kritinv
January 22, 2025 01:17 23s kritinv:task-completion
January 22, 2025 01:17 23s
tool correctness + input parameters and output
Lint #1996: Pull request #1293 synchronize by kritinv
January 21, 2025 08:57 22s kritinv:tool-correctness
January 21, 2025 08:57 22s
tool correctness + input parameters and output
Lint #1995: Pull request #1293 synchronize by kritinv
January 21, 2025 08:39 47s kritinv:tool-correctness
January 21, 2025 08:39 47s