Skip to content

Commit c149f79

Browse files
committed
updated docs
1 parent 7bce1a0 commit c149f79

File tree

2 files changed

+6
-3
lines changed

2 files changed

+6
-3
lines changed

docs/confident-ai/confident-ai-introduction.mdx

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@ import Equation from "@site/src/components/equation";
1010
Are you following best LLM evaluation practices? Without a serious evaluation workflow, your testing results aren't really valid, and you might be wasting a lot of time iterating on the wrong things.
1111
:::
1212

13-
**Confident AI is the LLM evaluation platform for DeepEval**. It is native to DeepEval, and was designed for teams building LLM applications to maximize its performance, and to safeguard against unsatisfactory LLM outputs. Whilst DeepEval's open-source metrics are great for running evaluations, there is so much more to building a robust LLM evaluation workflow than collecting metric scores.
13+
**Confident AI is the cloud platform for DeepEval**. It is native to DeepEval, and was designed for teams building LLM applications to maximize its performance, and to safeguard against unsatisfactory LLM outputs. Whilst DeepEval's open-source metrics are great for running evaluations, there is so much more to building a robust LLM evaluation workflow than collecting metric scores.
1414

1515
If you're _serious_ about LLM evaluation, Confident AI is for you.
1616

docs/docs/getting-started.mdx

Lines changed: 5 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -16,7 +16,7 @@ and iterate on LLM (applications) and was built with the following principles in
1616
- Red team, safety scan LLM applications for security vulnerabilities.
1717
- Real-time evaluations in production.
1818

19-
Additionally, DeepEval integrates natively with [Confident AI](https://app.confident-ai.com), which allows anyone to **evaluate, regression test, red team, and monitor** LLM applications on the cloud.
19+
Additionally, DeepEval has a cloud platform [Confident AI](https://app.confident-ai.com), which allow teams to use DeepEval to **evaluate, regression test, red team, and monitor** LLM applications on the cloud.
2020

2121
<Envelope />
2222

@@ -300,7 +300,10 @@ correct_order_node = NonBinaryJudgementNode(
300300

301301
correct_headings_node = BinaryJudgementNode(
302302
criteria="Does the summary headings contain all three: 'intro', 'body', and 'conclusion'?",
303-
children=[VerdictNode(verdict=False, score=0), correct_order_node],
303+
children=[
304+
VerdictNode(verdict=False, score=0),
305+
VerdictNode(verdict=True, child=correct_order_node)
306+
],
304307
)
305308

306309
extract_headings_node = TaskNode(

0 commit comments

Comments
 (0)