Definition
The is code test allows you to check if a specified column contains executable code. Currently, Python and
JS code are supported.
Taxonomy
- Category: Integrity.
- Task types: LLM.
- Availability:
and .
Why it matters
- If the LLM is used for code generation or understanding, it is important to ensure that the code is valid and executable.
- For code generation with LLMs, it is particularly important to ensure that the generated code is valid, and not a hallucination.
Test configuration examples
If you are writing a tests.json
, here are a few valid configurations for the character length test:
[
{
"name": "Outputs have valid Python code",
"description": "Asserts that the output column contains valid Python code",
"type": "integrity",
"subtype": "isCode",
"thresholds": [
{
"insightName": "isCode",
"insightParameters": [
{ "name": "column_name", "value": "output" }, // Selects the column `output`
{ "name": "language", "value": "python" } // Checks for valid Python code
],
"measurement": "isCodeRowPercentage",
"operator": ">=",
"value": 1.0
}
],
"subpopulationFilters": null,
"mode": "development",
"usesValidationDataset": true, // Apply test to the validation set
"usesTrainingDataset": false,
"usesMlModel": false,
"syncId": "b4dee7dc-4f15-48ca-a282-63e2c04e0689" // Some unique id
}
]