Home Schema Example JSON Schema

Test Suite

Type: object

Test suite containing test cases and associated metadata.

Endpoint


The Jarvis endpoint for running the test. Defaults to the runner's environment.

Jarvis V1 Endpoint

Type: object

Describes the endpoint for Jarvis V1.

Endpoint URL

Default: null

The HTTPS endpoint for Jarvis.

Type: stringFormat: uri

Must be at least 1 characters long

Must be at most 2083 characters long

HTTP Timeout

Type: number Default: 10

The timeout in seconds for HTTP requests.

Max Concurrency

Type: integer Default: 10

The maximum number of concurrent queries to run. Defaults to 10.

Type

Type: const Default: "v1"
Specific value: "v1"

Jarvis V2 Endpoint

Type: object

Describes the endpoint for Jarvis V2.

Async Timeout

Type: number Default: 60

The timeout in seconds for async queries. Only applicable when use_async is True.

Endpoint URL

Default: null

The HTTPS endpoint for Jarvis.

Type: stringFormat: uri

Must be at least 1 characters long

Must be at most 2083 characters long

HTTP Timeout

Type: number Default: 10

The timeout in seconds for HTTP requests.

Index Name

Default: null

The name of the index to run the query against.

Max Concurrency

Type: integer Default: 10

The maximum number of concurrent queries to run. Defaults to 10.

Query Engine

Type: string

The query engine to use.

Type

Type: const Default: "v2"
Specific value: "v2"

Use Async

Type: boolean Default: true

Whether to use the async query flow. This is only applicable when querying against a remote server. Local queries are async by default. Defaults to true.

Metadata

Type: object

The metadata for the root model

Description

Type: string Default: ""

The description of the test suite.

Name

Type: string

The name of the test suite.

Must be at least 1 characters long

Version

Type: string Default: "unknown"

Version of this test suite based on Kondo.

Test Cases

Type: array

The list of test cases

Must contain a minimum of 1 items

No Additional Items

Each item of this array must be:

Test Case

Type: object

Test cases are a single unit of test within a test suite.

Asserts

Type: array

The asserts for this test case.

Must contain a minimum of 1 items

No Additional Items

Each item of this array must be:


Response Metadata Assert

Type: object

Asserts that a response metadata has a certain value.

Path

Type: string

The JMES path to the metadata value that we want to assert.

Must be at least 1 characters long

Type

Type: const Default: "response_metadata"
Specific value: "response_metadata"

Query Elapsed Time Assert

Type: object

Assert that a query took less than a certain time.

Maximum

Default: null

The maximum amount of time that a query took. Defaults to None meaning there is no upper bound.

Minimum

Type: number Default: 0.0

The minimum amount of time that a query took. Defaults to 0.

Value must be greater or equal to 0

Type

Type: const Default: "query_elapsed_time"
Specific value: "query_elapsed_time"

Message Type Assert

Type: object

Assert that a messages response has a certain type.

Any Order

Type: boolean Default: false

If True, the message types can be in any order.

Match All

Type: boolean Default: true

If True, all message types should be present.

Message Types

Type: array of string

The message types that we want to assert.

Must contain a minimum of 1 items

No Additional Items

Each item of this array must be:

Type

Type: const Default: "message_type"
Specific value: "message_type"

Regexes Assert

Type: object

Assert that one or more regular expressions match.

Case Sensitive

Type: boolean Default: false

Whether the regexes should be case sensitive.

Is Ordered

Type: boolean Default: true

Whether the regexes should be matched in order within the message text. We assume regexes first followed by substrings. Defaults to true.

Minimum Should Match

Default: "all"

The minimum number of regexes or substrings that should match message text. If set to all, it must match all the regexes. Defaults to all.

Regexes

Type: array of string

The list of regexes for the assert.

No Additional Items

Each item of this array must be:

Substrings

Type: array of string

The list of substrings for the assert.

No Additional Items

Each item of this array must be:

Type

Type: const Default: "regexes"
Specific value: "regexes"

Inverse Regexes Assert

Type: object

Assert that one or more regular expressions don't match.

Case Sensitive

Type: boolean Default: false

Whether the regexes should be case sensitive.

Minimum Should Not Match

Default: "all"

The minimum number of regexes or substrings that should not match message text. If set to all, it must not match any of the regexes. Defaults to all.

Regexes

Type: array of string

The list of regexes for the assert.

No Additional Items

Each item of this array must be:

Substrings

Type: array of string

The list of substrings for the assert.

No Additional Items

Each item of this array must be:

Type

Type: const Default: "inverse_regexes"
Specific value: "inverse_regexes"

Regexes Match Count Assert

Type: object

Assert that the occurrences of regular expressions that match are in a certain range.

Case Sensitive

Type: boolean Default: false

Whether the regexes should be case sensitive.

Equals

Default: null

The number of occurrences of regexes or substrings that match message text should be equal to this. If this is set, minimum and maximum will be ignored. Defaults to None.

Maximum

Default: null

The maximum number of occurrences of regexes or substrings that match message text. Defaults to None meaning there is no upper bound.

Minimum

Type: integer Default: 1

The minimum number of occurrences of regexes or substrings that match message text. Defaults to 1.

Value must be greater or equal to 0

Regexes

Type: array of string

The list of regexes for the assert.

No Additional Items

Each item of this array must be:

Substrings

Type: array of string

The list of substrings for the assert.

No Additional Items

Each item of this array must be:

Type

Type: const Default: "regexes_count"
Specific value: "regexes_count"

Answer Relevancy Assert

Type: object

Implements AnswerRelevancy from Deepeval: https://docs.confident-ai.com/docs/metrics-answer-relevancy

Evaluation LLM settings

Type: object

Settings for the evaluation LLM to process LLM metrics.

Context Template

Type: string Default: "You are a friendly and warm question-answer hospital assistant, employed as an employee of the hospital.\n The only information you know is the context provided. You can use only that information as your hidden knowledge base.\n When you answer, the context provided will be known as \"my training.\"\n If the context provided provides links, show it to the user."

Template for the context

Guardrail Identifier

Type: string Default: "arn:aws:bedrock:us-west-2:394252546268:guardrail/f83z6kx1d6hl"

Name of AWS guardrail if applicable. Should start with arn:aws:bedrock:us-west-2:.

Guardrail Version

Type: string Default: "1"

AWS Guardrail version (string) if applicable

Max Tokens

Type: integer Default: 1000

Max tokens returned by LLM

Model

Type: string

Name of LLM as per HuggingFace

Must be at least 1 characters long

Reranker Similarity Cutoff

Type: number Default: 0

Similarity score cutoff for node postprocessing based on reranker score

Similarity Cutoff

Type: number Default: 0.38

Similarity score cutoff for node postprocessing based on embedding score

Similarity top K

Type: integer Default: 6

Number of nodes to return after retrieval

Supported language codes

Type: array of string

Restricts bot to only answer only these languages. Provide list of 2-letter language codes, and double-check that Amazon Comprehend / Translate supports them.

No Additional Items

Each item of this array must be:

System Prompt

Type: string Default: "Answer the QUERY below only using the DOCUMENTs below as context, and not your trained knowledge."

Prompt for the LLM

Temperature

Type: number Default: 0

Temperature

Maximum

Default: null

The maximum metric score required. Defaults to None meaning there is no upper bound.

Metric Keyword args

Type: object

Additional metric keyword arguments that you supply to the metric class

Additional Properties of any type are allowed.

Type: object

Minimum

Type: number Default: 0.0

The minimum metric score required. Defaults to 0.

Value must be greater or equal to 0

Type

Type: const Default: "answer_relevancy"
Specific value: "answer_relevancy"

Contextual Precision Assert

Type: object

Implements ContextualPrecision from Deepeval: https://docs.confident-ai.com/docs/metrics-contextual-precision

Expected Output

Type: string

The expected output for the query which we will evaluate against.

Must be at least 1 characters long

Evaluation LLM settings

Type: object

Settings for the evaluation LLM to process LLM metrics.

Context Template

Type: string Default: "You are a friendly and warm question-answer hospital assistant, employed as an employee of the hospital.\n The only information you know is the context provided. You can use only that information as your hidden knowledge base.\n When you answer, the context provided will be known as \"my training.\"\n If the context provided provides links, show it to the user."

Template for the context

Guardrail Identifier

Type: string Default: "arn:aws:bedrock:us-west-2:394252546268:guardrail/f83z6kx1d6hl"

Name of AWS guardrail if applicable. Should start with arn:aws:bedrock:us-west-2:.

Guardrail Version

Type: string Default: "1"

AWS Guardrail version (string) if applicable

Max Tokens

Type: integer Default: 1000

Max tokens returned by LLM

Model

Type: string

Name of LLM as per HuggingFace

Must be at least 1 characters long

Reranker Similarity Cutoff

Type: number Default: 0

Similarity score cutoff for node postprocessing based on reranker score

Similarity Cutoff

Type: number Default: 0.38

Similarity score cutoff for node postprocessing based on embedding score

Similarity top K

Type: integer Default: 6

Number of nodes to return after retrieval

Supported language codes

Type: array of string

Restricts bot to only answer only these languages. Provide list of 2-letter language codes, and double-check that Amazon Comprehend / Translate supports them.

No Additional Items

Each item of this array must be:

System Prompt

Type: string Default: "Answer the QUERY below only using the DOCUMENTs below as context, and not your trained knowledge."

Prompt for the LLM

Temperature

Type: number Default: 0

Temperature

Maximum

Default: null

The maximum metric score required. Defaults to None meaning there is no upper bound.

Metric Keyword args

Type: object

Additional metric keyword arguments that you supply to the metric class

Additional Properties of any type are allowed.

Type: object

Minimum

Type: number Default: 0.0

The minimum metric score required. Defaults to 0.

Value must be greater or equal to 0

Type

Type: const Default: "contextual_precision"
Specific value: "contextual_precision"

Contextual Recall Assert

Type: object

Implements ContextualRecall from Deepeval: https://docs.confident-ai.com/docs/metrics-contextual-recall

Expected Output

Type: string

The expected output for the query which we will evaluate against.

Must be at least 1 characters long

Evaluation LLM settings

Type: object

Settings for the evaluation LLM to process LLM metrics.

Context Template

Type: string Default: "You are a friendly and warm question-answer hospital assistant, employed as an employee of the hospital.\n The only information you know is the context provided. You can use only that information as your hidden knowledge base.\n When you answer, the context provided will be known as \"my training.\"\n If the context provided provides links, show it to the user."

Template for the context

Guardrail Identifier

Type: string Default: "arn:aws:bedrock:us-west-2:394252546268:guardrail/f83z6kx1d6hl"

Name of AWS guardrail if applicable. Should start with arn:aws:bedrock:us-west-2:.

Guardrail Version

Type: string Default: "1"

AWS Guardrail version (string) if applicable

Max Tokens

Type: integer Default: 1000

Max tokens returned by LLM

Model

Type: string

Name of LLM as per HuggingFace

Must be at least 1 characters long

Reranker Similarity Cutoff

Type: number Default: 0

Similarity score cutoff for node postprocessing based on reranker score

Similarity Cutoff

Type: number Default: 0.38

Similarity score cutoff for node postprocessing based on embedding score

Similarity top K

Type: integer Default: 6

Number of nodes to return after retrieval

Supported language codes

Type: array of string

Restricts bot to only answer only these languages. Provide list of 2-letter language codes, and double-check that Amazon Comprehend / Translate supports them.

No Additional Items

Each item of this array must be:

System Prompt

Type: string Default: "Answer the QUERY below only using the DOCUMENTs below as context, and not your trained knowledge."

Prompt for the LLM

Temperature

Type: number Default: 0

Temperature

Maximum

Default: null

The maximum metric score required. Defaults to None meaning there is no upper bound.

Metric Keyword args

Type: object

Additional metric keyword arguments that you supply to the metric class

Additional Properties of any type are allowed.

Type: object

Minimum

Type: number Default: 0.0

The minimum metric score required. Defaults to 0.

Value must be greater or equal to 0

Type

Type: const Default: "contextual_recall"
Specific value: "contextual_recall"

Contextual Relevancy Assert

Type: object

Implements ContextualRelevancy from Deepeval: https://docs.confident-ai.com/docs/metrics-contextual-relevancy

Evaluation LLM settings

Type: object

Settings for the evaluation LLM to process LLM metrics.

Context Template

Type: string Default: "You are a friendly and warm question-answer hospital assistant, employed as an employee of the hospital.\n The only information you know is the context provided. You can use only that information as your hidden knowledge base.\n When you answer, the context provided will be known as \"my training.\"\n If the context provided provides links, show it to the user."

Template for the context

Guardrail Identifier

Type: string Default: "arn:aws:bedrock:us-west-2:394252546268:guardrail/f83z6kx1d6hl"

Name of AWS guardrail if applicable. Should start with arn:aws:bedrock:us-west-2:.

Guardrail Version

Type: string Default: "1"

AWS Guardrail version (string) if applicable

Max Tokens

Type: integer Default: 1000

Max tokens returned by LLM

Model

Type: string

Name of LLM as per HuggingFace

Must be at least 1 characters long

Reranker Similarity Cutoff

Type: number Default: 0

Similarity score cutoff for node postprocessing based on reranker score

Similarity Cutoff

Type: number Default: 0.38

Similarity score cutoff for node postprocessing based on embedding score

Similarity top K

Type: integer Default: 6

Number of nodes to return after retrieval

Supported language codes

Type: array of string

Restricts bot to only answer only these languages. Provide list of 2-letter language codes, and double-check that Amazon Comprehend / Translate supports them.

No Additional Items

Each item of this array must be:

System Prompt

Type: string Default: "Answer the QUERY below only using the DOCUMENTs below as context, and not your trained knowledge."

Prompt for the LLM

Temperature

Type: number Default: 0

Temperature

Maximum

Default: null

The maximum metric score required. Defaults to None meaning there is no upper bound.

Metric Keyword args

Type: object

Additional metric keyword arguments that you supply to the metric class

Additional Properties of any type are allowed.

Type: object

Minimum

Type: number Default: 0.0

The minimum metric score required. Defaults to 0.

Value must be greater or equal to 0

Type

Type: const Default: "contextual_relevancy"
Specific value: "contextual_relevancy"

Correctness Assert

Type: object

Implements Correctness from Deepeval: https://docs.confident-ai.com/docs/guides-answer-correctness-metric#:~:text=Answer%20Correctness%20(or%20Correctness)%20is,0%20indicating%20an%20incorrect%20one.

Expected Output

Type: string

The expected output for the query which we will evaluate against.

Must be at least 1 characters long

Evaluation LLM settings

Type: object

Settings for the evaluation LLM to process LLM metrics.

Context Template

Type: string Default: "You are a friendly and warm question-answer hospital assistant, employed as an employee of the hospital.\n The only information you know is the context provided. You can use only that information as your hidden knowledge base.\n When you answer, the context provided will be known as \"my training.\"\n If the context provided provides links, show it to the user."

Template for the context

Guardrail Identifier

Type: string Default: "arn:aws:bedrock:us-west-2:394252546268:guardrail/f83z6kx1d6hl"

Name of AWS guardrail if applicable. Should start with arn:aws:bedrock:us-west-2:.

Guardrail Version

Type: string Default: "1"

AWS Guardrail version (string) if applicable

Max Tokens

Type: integer Default: 1000

Max tokens returned by LLM

Model

Type: string

Name of LLM as per HuggingFace

Must be at least 1 characters long

Reranker Similarity Cutoff

Type: number Default: 0

Similarity score cutoff for node postprocessing based on reranker score

Similarity Cutoff

Type: number Default: 0.38

Similarity score cutoff for node postprocessing based on embedding score

Similarity top K

Type: integer Default: 6

Number of nodes to return after retrieval

Supported language codes

Type: array of string

Restricts bot to only answer only these languages. Provide list of 2-letter language codes, and double-check that Amazon Comprehend / Translate supports them.

No Additional Items

Each item of this array must be:

System Prompt

Type: string Default: "Answer the QUERY below only using the DOCUMENTs below as context, and not your trained knowledge."

Prompt for the LLM

Temperature

Type: number Default: 0

Temperature

Maximum

Default: null

The maximum metric score required. Defaults to None meaning there is no upper bound.

Metric Keyword args

Type: object

Additional metric keyword arguments that you supply to the metric class

Additional Properties of any type are allowed.

Type: object

Minimum

Type: number Default: 0.0

The minimum metric score required. Defaults to 0.

Value must be greater or equal to 0

Type

Type: const Default: "correctness"
Specific value: "correctness"

Faithfulness Assert

Type: object

Implements Faithfulness from Deepeval: https://docs.confident-ai.com/docs/metrics-faithfulness

Evaluation LLM settings

Type: object

Settings for the evaluation LLM to process LLM metrics.

Context Template

Type: string Default: "You are a friendly and warm question-answer hospital assistant, employed as an employee of the hospital.\n The only information you know is the context provided. You can use only that information as your hidden knowledge base.\n When you answer, the context provided will be known as \"my training.\"\n If the context provided provides links, show it to the user."

Template for the context

Guardrail Identifier

Type: string Default: "arn:aws:bedrock:us-west-2:394252546268:guardrail/f83z6kx1d6hl"

Name of AWS guardrail if applicable. Should start with arn:aws:bedrock:us-west-2:.

Guardrail Version

Type: string Default: "1"

AWS Guardrail version (string) if applicable

Max Tokens

Type: integer Default: 1000

Max tokens returned by LLM

Model

Type: string

Name of LLM as per HuggingFace

Must be at least 1 characters long

Reranker Similarity Cutoff

Type: number Default: 0

Similarity score cutoff for node postprocessing based on reranker score

Similarity Cutoff

Type: number Default: 0.38

Similarity score cutoff for node postprocessing based on embedding score

Similarity top K

Type: integer Default: 6

Number of nodes to return after retrieval

Supported language codes

Type: array of string

Restricts bot to only answer only these languages. Provide list of 2-letter language codes, and double-check that Amazon Comprehend / Translate supports them.

No Additional Items

Each item of this array must be:

System Prompt

Type: string Default: "Answer the QUERY below only using the DOCUMENTs below as context, and not your trained knowledge."

Prompt for the LLM

Temperature

Type: number Default: 0

Temperature

Maximum

Default: null

The maximum metric score required. Defaults to None meaning there is no upper bound.

Metric Keyword args

Type: object

Additional metric keyword arguments that you supply to the metric class

Additional Properties of any type are allowed.

Type: object

Minimum

Type: number Default: 0.0

The minimum metric score required. Defaults to 0.

Value must be greater or equal to 0

Type

Type: const Default: "faithfulness"
Specific value: "faithfulness"

Hallucination Assert

Type: object

Implements Hallucination from Deepeval: https://docs.confident-ai.com/docs/metrics-hallucination

Context

Type: array of string

The context to evaluate hallucination against.

Must contain a minimum of 1 items

No Additional Items

Each item of this array must be:

Type: string

Must be at least 1 characters long

Evaluation LLM settings

Type: object

Settings for the evaluation LLM to process LLM metrics.

Context Template

Type: string Default: "You are a friendly and warm question-answer hospital assistant, employed as an employee of the hospital.\n The only information you know is the context provided. You can use only that information as your hidden knowledge base.\n When you answer, the context provided will be known as \"my training.\"\n If the context provided provides links, show it to the user."

Template for the context

Guardrail Identifier

Type: string Default: "arn:aws:bedrock:us-west-2:394252546268:guardrail/f83z6kx1d6hl"

Name of AWS guardrail if applicable. Should start with arn:aws:bedrock:us-west-2:.

Guardrail Version

Type: string Default: "1"

AWS Guardrail version (string) if applicable

Max Tokens

Type: integer Default: 1000

Max tokens returned by LLM

Model

Type: string

Name of LLM as per HuggingFace

Must be at least 1 characters long

Reranker Similarity Cutoff

Type: number Default: 0

Similarity score cutoff for node postprocessing based on reranker score

Similarity Cutoff

Type: number Default: 0.38

Similarity score cutoff for node postprocessing based on embedding score

Similarity top K

Type: integer Default: 6

Number of nodes to return after retrieval

Supported language codes

Type: array of string

Restricts bot to only answer only these languages. Provide list of 2-letter language codes, and double-check that Amazon Comprehend / Translate supports them.

No Additional Items

Each item of this array must be:

System Prompt

Type: string Default: "Answer the QUERY below only using the DOCUMENTs below as context, and not your trained knowledge."

Prompt for the LLM

Temperature

Type: number Default: 0

Temperature

Maximum

Default: null

The maximum metric score required. Defaults to None meaning there is no upper bound.

Metric Keyword args

Type: object

Additional metric keyword arguments that you supply to the metric class

Additional Properties of any type are allowed.

Type: object

Minimum

Type: number Default: 0.0

The minimum metric score required. Defaults to 0.

Value must be greater or equal to 0

Type

Type: const Default: "hallucination"
Specific value: "hallucination"

Description

Default: null

The description of the test case. This can be used to describe the motivation for the test case.

Name

Default: null

The name of the test case. Defaults to the query.

Query

Type: object

The query for the test case, which includes actual message and user profile.

Profile


The profile of the user executing the query. If specified as a string, it will attempt to load the predefined profiles.

Jarvis V1 Profile

Type: object

Profile for Jarvis V1.

Department

Type: string Default: ""

The department of the profile user

Designation

Type: string Default: ""

The designation of the profile user

Hospital

Type: string

The organization key associated with the hospital profile.

Must be at least 1 characters long

Tags

Type: array of string

The tags associated with the profile.

No Additional Items

Each item of this array must be:

UID

Type: string Default: "undefined"

The unique identifier for the profile.

Must be at least 1 characters long

JarvisProfile

Type: object

Profile for Jarvis V2.

Hospital

Default: null

Profile metadata associated with a hospital user.

Tenant


The tenant of this profile.

Cleo Tenant

Type: string

The cleo app tenant applicable to this object.

Must match regular expression: ^cleo\:[a-zA-Z0-9][\w\-\_]*$

Hospital Tenant

Type: string

The hospital app tenant applicable to this object.

Must match regular expression: ^hospital\:[a-zA-Z0-9][\w\-\_]*$

Session Options

Default: null

Additional Options regarding the session related changes, like overriding certain values, etc.

QuerySessionOptions

Type: object

CleoQuerySessionOptions

Type: object

Device Uid

Type: array of string
No Additional Items

Each item of this array must be:

User Uid

Type: array of string
No Additional Items

Each item of this array must be:

EinsteinQuerySessionOptions

Type: object

Llm Search Mode

Type: enum (of string)

Must be one of:

  • "retrieval"
  • "search_summarize"

Query Text

Type: string

The text of the query.

Must be at least 1 characters long

Test Case ID

Default: null

The test case ID. Note that this is usually set during loading of the test suite.

Tenant

Default: null

The tenant for which the docs come from/are to be answered.