Home Schema Example JSON Schema

Data Source


Settings for configuring an Einstein data source.

No Additional Properties

Type: object

The following properties are required:

  • google_drive
Type: object

The following properties are required:

  • ingestor
Type: object

The following properties are required:

  • mock
Type: object

The following properties are required:

  • microsoft_sharepoint
Type: object

The following properties are required:

  • noop
Type: object

The following properties are required:

  • noop_dialogue
Type: object

The following properties are required:

  • s3
Type: object

The following properties are required:

  • tabular

Allow User Upload

Type: boolean

Whether this data source allows user upload. Defaults to false.

Convert To PDF

Type: boolean

Determines if a entries from this data source should be converted to PDF. Defaults to true.

Permanantly delete deactivated entries.

Type: boolean

Boolean flag to indicate that deactivated entries should be deleted. Defaults to false.

Recreate all entries.

Type: boolean

Recreate all entries skipping find and is_same in syncer (faster for large number of entries eg. 50k directory). Defaults to false.

Type: object

Deletable

Type: boolean

Should UI be able to delete entries in this DataSource. Defaults to true.

User Created

Type: boolean

Indicates if the entries in the DataSource are user created or preset. Defaults to true.

Generate file pages.json

Type: boolean

Determines if a file pages.json containing a mapping of text page number to text will be generated and uploaded to storages. Defaults to false.

Method For PDF Export

Type: enum (of string)

The method used for converting office documents to PDF. Defaults to convertapi.

Must be one of:

  • "convertapi"
  • "libreoffice"

Chernobyl

Type: object

Settings related to Chernobyl module.

No Additional Properties

Key

Type: string

The corresponding Chernobyl MiniApp key in Jarvis.

Must be at least 1 characters long

Fields

Type: array

Settings for fields to be used when syncing.

Must contain a minimum of 1 items

All items must be unique

No Additional Items

Each item of this array must be:

Type: object
No Additional Properties

Key

Type: string

Uniquely identifiable key used internally for this field. It will be used as the key when storing values associated with this field.

Must be at least 1 characters long

Encrypt

Type: boolean

If this field should be encrypted in the database (defaults to false).

Synonyms File URL

Type: string

The URL for the synonyms file to use for synonyms transformation. Use only links to our own s3.

Must be at least 1 characters long

Eliza (Dialogues)

Type: object

Settings related to Eliza (Dialogues) module.

No Additional Properties

Key

Type: string

The corresponding Eliza provider key in Jarvis.

Must be at least 1 characters long

Rosters

Type: object

Settings related to Rosters module.

No Additional Properties

Key

Type: string

The corresponding Rosters Provider key in Jarvis.

Must be at least 1 characters long

Synonyms

Type: string

The S3 uri to a synonyms YAML file used by the roster parser (uses default if not provided).

Must be at least 1 characters long

Fields

Type: array

Settings for fields to be used when syncing.

Must contain a minimum of 1 items

All items must be unique

No Additional Items

Each item of this array must be:

Use Fake Dates

Type: boolean

Indicate DataSource to generate fake dates for entries (today).

SingHealth

Type: object

Settings related to singhealth datasources.

Institutions

Type: array of string

The Singhealth institution names as it appears in the column to filter the rows by.

Must contain a minimum of 1 items

All items must be unique

No Additional Items

Each item of this array must be:

Type: string

Must be at least 1 characters long

Google Drive

Type: object

Settings for using Google Drive as a data source.

No Additional Properties

Google Cloud Credentials

Type: object

The service account credentials.

Type

Type: const

Credential type. Must be service_account.

Specific value: "service_account"

Private Key

Type: string

Private key associated with the credential.

Must be at least 1 characters long

Additional Properties of any type are allowed.

Type: object

Google Drive URL

Type: stringFormat: uri

The unique URL of the Google Drive or folder.

Must be at least 1 characters long

Ingestor


Settings for using a ingestor as a data source.

Type: object

The following properties are required:

  • content_uid
Type: object

The following properties are required:

  • worker_keys
Type: object

The following properties are required:

  • content_uri
  • worker_key
Type: object

The following properties are required:

  • content_uris
  • worker_key
Type: object

The following properties are required:

  • content_uri__startswith
  • worker_key
Type: object

The following properties are required:

  • content_uri__regex
  • worker_key

Content UID

Type: string

UID of the IngestorContent to sync (may change on ingestor wipe, use content_uri and worker_key instead).

Must be at least 1 characters long

Content URI

Type: string

URI of the IngestorContent to sync (use with worker_key).

Must be at least 1 characters long

Content URI List

Type: array of string

List of URIs of the IngestorContent to sync (use with worker_key).

Must contain a minimum of 1 items

All items must be unique

No Additional Items

Each item of this array must be:

Type: string

Must be at least 1 characters long

Worker Key

Type: string

Worker key of the IngestorContent to sync (use with content_uri).

Must be at least 1 characters long

URI Starts With

Type: string

Prefix for the IngestorContent content URI.

Must be at least 1 characters long

URI Match for Regex

Type: string

Regex filter for IngestorContent content URI.

Must be at least 1 characters long

Worker Keys

Type: array of string

The worker keys of IngestorContent to sync.

Must contain a minimum of 1 items

All items must be unique

No Additional Items

Each item of this array must be:

Type: string

Must be at least 1 characters long

Active Excel Sheets

Type: array of string

The sheet names that should be synced.

Must contain a minimum of 1 items

All items must be unique

No Additional Items

Each item of this array must be:

Type: string

Must be at least 1 characters long

Start From Row

Type: integer

The excel row to start parsing from (the first row should default to the column header, defaults to 1).

Mock

Type: object

Settings for Mock data source. This is used in unit tests and not be used in real life.

Microsoft Sharepoint

Type: object

Settings for using Microsoft Sharepoint as a data source.

No Additional Properties

Microsoft Sharepoint Credentials

Type: object

Sharepoint client secret

grant_type

Type: const

Grant permission type. Must be client_credentials

Specific value: "client_credentials"

client_secret

Type: string

Client secret generated from MS Azure AD.

Must be at least 1 characters long

scope

Type: const

Scope of the token generated. Must be https://graph.microsoft.com/.default

Specific value: "https://graph.microsoft.com/.default"

client_id

Type: string

client_id generated from MS Azure AD.

Must be at least 1 characters long

Tenant id

Type: string

Tenant id of the app.

Additional Properties of any type are allowed.

Type: object

Sharepoint Drive url

Type: stringFormat: uri

Url of sharepoint drive id that contains all the files.

Must be at least 1 characters long

Noop

Type: object

Settings for using a Noop data source. This is useful as a dummy data source for user uploaded entries.

Noop Dialogue

Type: object

Settings for using a Noop dialogue data source. This is useful as a dummy data source for user uploaded entries.

S3


Settings for using S3 as a data source. Access control will depend on permissions being assigned to the Hippocrates task role: staging-hippo-ecs_task_role-20180611204658589300000001 and product-hippo-ecs_task_role-20180620231135019400000002.

No Additional Properties

Type: object

The following properties are required:

  • uri
Type: object

The following properties are required:

  • uris

URI

Type: stringFormat: uri

The s3:// URI to the bucket along with key prefix. A trailing slash is always added.

Must be at least 1 characters long

URIs

Type: array of string

The s3:// URIs to the buckets along with key prefix. A trailing slash is always added.

Must contain a minimum of 1 items

All items must be unique

No Additional Items

Each item of this array must be:

Type: string

Must be at least 1 characters long

Tabular interface

Type: object

Settings for using a tabular interface as a data source.

Type: object

The following properties are required:

  • url
Type: object

The following properties are required:

  • urls

URL

Type: string

The URL containing tabular data.

Must be at least 1 characters long

URLs

Type: array of string

URLs containing tabular data (for use when data is split into multiple tables).

Must contain a minimum of 1 items

No Additional Items

Each item of this array must be:

Type: string

Must be at least 1 characters long

Sheet Name

Type: string

Name of the sheet in a workbook

Must be at least 1 characters long

Status Column

Type: string

Used to denote which is the Status column. Defaults to Status.

If any of development, staging, production, test appears in the status column, the row will be ignored unless it is executed in the same environment (as denoted by settings.ENVIRONMENT).

If ignore appears in the status column, the row is ignored.

Must be at least 1 characters long

Headers

Type: array of string

An array of string to denote the headers for the TabularInterface. Useful if the source data does not define headers.

Must contain a minimum of 1 items

No Additional Items

Each item of this array must be:

Type: string

Must be at least 1 characters long

Skip Header Row

Type: boolean

Whether to skip the header row.

This is a misnomer. It refers to whether the source data contains a header row.

When set to true, every row of the source data is read.

When set to false (default behavior), the first row of the data is treated as the headers value and data starts from the 2nd row onwards.

Each additional property must conform to the following schema

Type: object

Additional properties will be passed on to the __init__ for the TabularInterface reader.