ParseJSON
New to Analytica 5.0
This function requires the Analytica Enterprise edition or higher (e.g., Analytica Optimizer, ADE or CubePlan).
ParseJSON( json, objectSchema..., flags )
Parses JSON text in «json» to generate corresponding Analytica data and arrays. Usually, you will obtain the «json» text from a call to ReadTextFile or ReadFromUrl. It works without an «objectSchema». But you can specify an «objectSchema» to help map the structure of the data using indexes in your model.
JavaScript Object Notation (JSON) is a widely used lightweight data-interchange format. It is easy for humans and machines to read and write.
Parameters
- «json»: A JSON-formatted text to parse.
- «objectSchema»: A schema describing the JSON class structure and mapping it into Analytica index(es). See Parsing with a schema. If given no «objectSchema», see Parsing without a schema below.
- «flags»: (optional) A bit field of flags that control various aspects of parsing. Bit settings are
- 1 = During schema-free parsing, create local indexes
.Dim1
,.Dim2
, ... for arrays. Without this, each level of an array is returned as a reference to a list.
- 1 = During schema-free parsing, create local indexes
Parsing without a schema
When the «objectSchema» parameter is not specified,
ParseJSON( json )
parses the «json» data without using a schema.
Reading JSON objects
The top-level item in a JSON document should be a JSON-object. Here is an example of a JSON object.
Variable json1 Definition: '{ "title" : "1984", "author" : "George Orwell", "year" : 1949, "pages" : 336, "paperback" : true }'
ParseJSON(json1)
→
Notice that a local index named .Member
is automatically created. The 'title'
is text, the 'year'
is a number.
- Variable parse1a :=
ParseJSON(json1)
TypeOf(parse1a[.Member='title']) → 'Text'
TypeOf(parse1a[.Member='year']) → 'Number'
When there are nested objects, local indexes are created at each level. To prevent the indexes from combining into a rectangular array, the member object is placed within a reference.
Variable json2 Definition: '{ "title" : "1984", "author" : { "first" : "George", "last" : "Orwell" }, "year" : 1949, "pages" : 336, "paperback" : true }'
- Variable parse2 :=
ParseJSON(json2)
parse2
→#parse2[.Member='author']
→
Reading JSON arrays
Without using a schema, ParseJSON does not map array data to existing indexes that you might have. Without a schema, you have two options. In the first option, it will return arrays as lists and nested lists. In Analytica, to nest a list (and avoid having the implicit dimension combine with other indexes), the list is placed inside a reference. The first option is used unless the 1 bit of «flags» is set. The second option is to have ParseJSON created local indexes automatically, which are named .Dim1
, .Dim2
, etc., corresponding to the nesting level in «json». The second option produces a multi-dimensional array without nesting.
Variable json3 :='{ "data" : [ [ 1,2], [3,4], [5,6] ] }'
Variable parse3 :=ParseJSON(json3)
Variable parse3b := ParseJSON(json3, flags:1)
Parsing with a schema
A schema describes the data structure of a Java Script object, and specifies the index in your model that encodes the object.
JSON object schemas
The class structure for a JavaScript object is described an a 1-D array, where the index contains the member names, and the cell values describe the nested structure. For example, consider the earlier json2
data:
'{ "title" : "1984", "author" : { "first" : "George", "last" : "Orwell" }, "year" : 1949, "pages" : 336, "paperback" : true }'
This has a top-level JavaScript object (Book) and a nested JavaScript object (PersonName). We can encode these two «objectSchema» as follows.
- Index Book :=
['title', 'author', 'year', 'pages', 'paperback']
- Index PersonName :=
['first', 'last']
- Variable BookSchema :=
Table(Book)('atom',Handle(PersonName),'atom','atom','atom')
- Variable NameSchema :=
Array(PersonName,'atom')
- Variable parse4 :=
ParseJSON(json2, BookSchema, NameSchema )
parse4
→#parse4[Book='author']
→
Notice that the existing indexes (Book
and PersonName
) are used, rather than local indexes created by the function.
When specifying multiple object schema, the first «objectSchema» listed must be the top-level object.
The labels in the index must match the JSON object's member names exactly (case-sensitive), but the ordering of your index labels does not need to match the order in which the member values appear in the JSON. You can include extra member names in your index, but there every member that appears in the «json» data must appear in your index or an error will issue.
Member schema options
The following options can be used in a cell of an «objectSchema», each describing what is expected for the value of the corresponding member.
'atom'
; The text 'atom' specifies that the data for that member shall not be an object or an array. It can be text (surrounded by double quotes), a number, or the keywords:null
,true
orfalse
.Null
: Any valid «json» is allowed, and the json appearing for that element is parsed without a schema.Handle(index)
: A handle to an index specifies that a JSON-object is expected, with member names that match the elements of «index». If a schema for that index appears in «objectSchema», that that schema guides the parsing. The result for this member will be a reference to a 1-D array indexed by «index».\ListOfHandles(I,J,K)
: A reference to a list of handles to indexes specifies that a JSON-array is expected here, and the indexes specify the indexes for the result, and the index order. The first index (i.e., «I») corresponds to the outermost index in the JSON array. When 2 or more indexes are listed, the final index can be either an array index or an object index. An object index is used when the «json» contains an array of objects.
Reading arrays with schema
The JSON standard expects the outermost object to be an object, so parsing with a schema always starts with the first «objectSchema». When a member contains an array, then the member schema should be a reference to a list of index handles. The indexes in that list specify the indexes for the resulting array.
- Variable json3 :=
'{ "data" : [ [ 1,2], [3,4], [5,6] ] }'
- Index
J
:=[1, 2, 3]
- Index
K
:=['k1', 'k2']
- Index D :=
['Data']
- Variable D_Schema :=
Table(D)( \ListOfHandles( J,K ) )
In the next example, the JSON contains an array of books, so that each item in the JSON-array is an object.
Variable json5 Definition: '{ "bibliography" : [ { "title" : "1984", "author" : { "first" : "George", "last" : "Orwell" }, "year" : 1949, "pages" : 336, "paperback" : true }, { "title" : "The Time Machine", "author" : { "first" : "H. G.", "last" : "Wells" }, "year" : 1895, "pages" : 118, "paperback" : true } ] }'
The schema for BookSchema
has a reference to a list of handles, indicating that an array result is expected, and specifying the indexes for the result. The last index listed is an object with an «objectSchema» (i.e., Book
, where BookSchema
is provided), thus encoding that an array of book objects is expected.
- Index Biblio :=
['bibliography']
- Variable BookSchema :=
Table(Biblio)(\ListOfHandles(Book_Num, Book))
- Index Book_Num :=
1..2
- Variable parse5 :=
ParseJSON( json5, BiblioSchema, BookSchema, NameSchema )
#parse5[Biblio='bibliography']
→
In this example, had to know the number of books in advance. This is a limitation, in that your indexes that appear in the schema must have enough length to accommodate the data. It is acceptable to specify more entries than actually exist in the data, for example:
- Index Book_Num :=
1..1K
which has space for one thousand books, even though only two appear in the «json». In this case, the excess slices along Book_Num contain Null. When you cannot guarantee an index that is guaranteed to be long enough, then you will need to use schema-free parsing for that member (put a Null in that member's schema).
Enable comment auto-refresher