nace-ai
/

navi-small-preview

policy_compliance

instruction_following

Model card Files Files and versions Community

ozyman commited on Jan 14

Commit

688becf

·

verified ·

1 Parent(s): d662ca6

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -128,7 +128,7 @@ F1 score was used to measure performance, prioritizing detection of noncomplianc
 ### Results
-NAVI-small-preview achieved an F1 score of 86.8% on public subset of PAV dataset, outperforming all tested alternatives except full-scale NAVI. We evaluate against general-purpose solutions like Claude and Open AI models, as well as some guardrails focusing on groundedness to demonstrate a clear distinction of policy verification from the more common groundedness.
 | Model                    | F1 Score | Avg Latency (ms) |
 |--------------------------|----------|------------------|

 ### Results
+NAVI-small-preview achieved an F1 score of 86.8% on public subset of PAV dataset, outperforming all tested alternatives except full-scale NAVI. We evaluate against general-purpose solutions like Claude and Open AI models, as well as some guardrails focusing on groundedness to demonstrate a clear distinction of policy verification from the more common groundedness verification.
 | Model                    | F1 Score | Avg Latency (ms) |
 |--------------------------|----------|------------------|