ReadBinaryFile
Release: |
4.6 • 5.0 • 5.1 • 5.2 • 5.3 • 5.4 • 6.0 • 6.1 • 6.2 • 6.3 • 6.4 • 6.5 |
---|
Requires Analytica 5.0 Enterprise or higher
ReadBinaryFile(filename, bytesPer, typeFlags, offset, resultIndex..., showDialog, title)
Reads data directly from a binary (i.e., non-text) file.
Flat data files are usually read using ReadTextFile. When you read text files, end-of-line characters are interpreted in various ways for you, conversions from character sets such as UTF-8 or Unicode may occur, and certain special byte values such as the zero byte or end-of-file cannot occur in the file. In contrast, reading from a binary file gives you access to the bytes of the file exactly as they appear.
ReadBinaryFile is most commonly used when you have a data file in a binary data format from another application, and need to access the internal bytes at particular locations. It can also be used in conjunction with WriteBinaryFile as a fast and convenient method to dump an Analytica array to a file and load it back in later.
The function GetFromBinaryData operates identically to ReadBinaryFile, except that it reads from an in-memory binary data term rather than from a file. If you plan to run your model runs in ACP over the web, it is important to read the file into memory with one call, and then use GetFromBinaryData to read each item so that the user is asked to upload the file only once.
Result Value(s)
These values are returned:
- The contents read.
- The full file path
- The offset where it left off reading. Useful if you want to read a different data structure that starts afterwards. (new to Analytica 6.4).
The main result returned is the contents it reads. This will either be an array indexed by the index(es) you specify in the «resultIndex» parameter, or a list of items if you omit the «resultIndex» parameter.
To capture one or both of the other result values, use:
Local ( content, path, nextOffset ) := ReadBinaryFile(....)
Reading bytes
The usage
ReadBinaryFile(filename)
returns the contents of the file as bytes, a list of integer values between 0 and 255.
Reading N values
To limit the number of items read, pass an index to the «resultIndex» parameter having a length equal to the maximum number of items to read. The actual number read will be less if the end-of-file is reached. For example,
ReadBinaryFile(filename, resultIndex: I)
Parameters
filename
Name of file to read from.
File name paths can be either relative or absolute. An absolute path specifies the full path of the file from a root drive, such as
"C:\Users\Drice\Documents\Projects\FatigueAnalysis\Results.bin"
A relative filename incompletely specifies a file path, and is interpreted relative to the CurrentDataDirectory.
I...
The index(es) that you want as index(es) of the result. If you provide no index, it returns an unindexed list, so the variable calling ReadBinaryFile will appear as the index. If you provide multiple indexes, the last index listed vary fastest and first varies slowest as items are read.
The number of bytes to read depends on the product of the sizes of the indexes and the number of bytes per cell, which could vary if Bytesper parameter varies by one or more of the Indexes. If the total is less than the number of bytes in the file, it will read only the amount of data needed. If the total is more than the file size, it will pad out the excess cells with NULL.
If the indexes are in a different order in the file from the canonical order within Analytica, it reforms (transposes) the array to the internal form needed. If the amount of data is very large file (e.g. file has hundreds of MB), this reform process can be quite time-consuming.
bytesPer
Specifies the number of bytes per item read. For example, when you specify bytesPer: 4
, then each 4 bytes becomes one cell of the result. When this is 1, then each byte is read and returned as a separate cell. Using 8 for «bytesPer» will result in the most memory-efficient encoding of contents when it is read into memory, since there is 8 times as much information per cell as would be the case for single bytes (and the bytes required per cell in memory would be the same in both cases).
When reading integer values (as is the default when «typeFlags» is omitted), this specifies the number of bytes per integer word, and can be 1 (for single bytes), 2 (for 16-bit words), 4 (for 32-bit integers) or 8 (for 64-bit integers).
When reading floating point real numbers, «bytesPer» can be 4 (for 32-bit IEEE 754 reals, often called floats), or 8 (for 64-bit IEEE 754 reals, often called doubles). When «typeFlags» has 2
(floating point), the default for «bytesPer» is 8 if not specified.
For the «typeFlags» option of 3
, nine bytes per item are used, so you can either omit «bytesPer» or set it to 9. Any other value throws an error.
For the «typeFlags» option of 4
, the number of bytes per item may vary (depending on the length of text values), and so «bytesPer» must be omitted or an error will be issued.
typeFlags
Controls how each item read gets converted into a value.
0
= Unsigned integer (positive only)1
= Signed integer (positive and negative numbers)2
= IEEE 754 Floating point real3
= Value (numbers, dates and null). 9 bytes per item.4
= Value (including textor binary data). Variable bytes per text item.7
= Raw binary data.8
= image32
= Big endian flag.
You can say the one-byte integer with hex value 0xc1 is equal to 193, or you can say that it equals -133. The distinction here is whether you interpret the raw binary data as representing an unsigned integer (193) or a signed integer (-133). Signed integers use two's-complement so that both negative and positive values are within the possible range. Use typeFlags: 0
when you want integers to be interpreted as unsigned, or 1
to include negative values. When «bytesPer» is 8, integers are read in as signed regardless.
A standard Intel convention writes numbers (integers or floating point values) with the least significant bytes written first, least significant bytes last, which is termed "little-endian format". This is the default assumed by WriteBinaryFile unless the typeFlags = 32
flag is specified. "Big endian" means that the most significant byte is written first.
You can add the 32
"big endian" option to the data type selection. So, for example, a value of 2 + 32
would specify a floating point format (2
) with the byte order reversed (32
).
The Value type is a variant data type, in which a leading byte encodes the internal data type of the value, and followed by an 8-byte value. The «typeFlags» option of 3
only writes Analytica data types that can be fully encoded in with 8 bytes, which includes integers, fixed-point reals, floating point reals, date and date-time numbers, and the special value Null. When using this type, Null values ARE written to the file. The «typeFlags» option of 3
cannot be used to write text values, since these vary in length. The cells will each use 9 bytes, and will be spaced every 9 bytes in the file.
The «typeFlags» option of 4
extends option 3
by allowing text valuesor binary data. When a text value occurs, the first byte indicates that the value is text, the next four bytes encode the length, n
, and the next n
bytes encodes the characters in UTF-8. When a binary data "blob" occurs, the first byte indicates the value in binary, the next 8 bytes specify the number of bytes, and then the next n
bytes are the data. When this format is used, the «bytesPer» parameter must be omitted. The location of items in the file are not spaced at any predictable interval, since text items vary in length.
The «typeFlags» option of 7 reads the next «bytesPer» bytes of the file into memory as a binary data term. When «bytesPer» is INF or omitted, the entire file is read. When «bytesPer» is specified and you haven't specified an index, it'll read a list of binary data terms of the given size. If you have specified and index, it'll read up to the index length binary data chunks. You can use this to read a binary file into memory, and then use GetFromBinaryData to perform multiple reads at different locations. This avoids extra overhead of opening and closing the file during each read. GetFromBinaryData has the same parameters as ReadBinaryFile, but reads from an in-memory binary term rather than from a binary file on disk.
offset
Specifies the file position to start reading from. Negative value specifies the distance (in bytes) from the end of the file, and a positive or zero value specifies the distance in bytes from the start of the file. You can use «offset» parameter to read data from a particular location inside the file.
When «offset» varies with the result index(es), you can use Null values in the «offset» array to indicate that the item resides directly after the previously read item. Hence, set to non-null integer values only for those items that start at a new offset.
It is likely you'll want to limit the number if items read to a maximum number. To do this, specify a result index. The length of the result index will be used as the maximum number of items read.
showDialog
This is a flag to force or suppress the file selector dialog. When not specified, the dialog only displays if needed, for example, if the file name is blank or the file doesn't exist. Setting «showDialog» to false suppresses the dialog from appearing. Setting «showDialog» to true forces the dialog to display, using «filename» as the initial default.
title
The text to use as caption of file selector dialog.
Examples
Fast Writing and reading of Analytica arrays
WriteBinaryFile() and ReadBinaryFile() are the fastest ways to write data from a model to a file or read from a file, because the numeric data does not need to be converted to text when writing or parsed when reading.
Assume: a
is an array indexed by indexes I, J, K
, and l
. Assume also that it contains only numbers, dates and null values -- no text, handles, references, etc.
Write the array to the file:
WriteBinaryFile("a.dat", a, I,J,K,L, bytesPer:12, typeFlags:3 )
Read the array back in:
ReadBinaryFile("a.dat", bytesPer:12, typeFlags:3, resultIndex: I, J, K, L )
Reading a complex binary structure
Binary Shape files (*.shp) and Shape index files (*.shx) have the following header layout for their first 100 bytes:
To read this, create an index for the items of the header:
- Index Header_item :=
['File Code','Unused','Unused','Unused','Unused','Unused','File length','Version','Shape type','Xmin','Ymin','Xmax','Ymax','Zmin','Zmax','Mmin','Mmax']
The bytes per item varies:
- Variable BytesPerItem :=
Table(Header_item)(4,4,4,4,4,4,4,4,4,8,8,8,8,8,8,8,8)
A big-endian (positive) integer is «typeFlag» 32, a little-endian integer is 0, and a little-endian double is 2. So the item types are:
- Variable ItemType :=
Table(Header_item)(32,32,32,32,32,32,32,0,0,2,2,2,2,2,2,2,2)
The full header is then read using:
ReadBinaryFile("filename.shx", bytesPer: BytesPerItem, typeFlags:itemType, resultIndex: Header_item )
The following model embellishes this with a user-interface for selecting the items in a general binary data record and using this to read the structure. For complex binary structures, this may simplify the coding.
- Model file: Reading binary data file.ana
- Try the model with this binary data file: File:NHDWaterbody.shx
From ACP
By specifying a blank filename, or a filename that you know does not exist on the server, ACP will display a file selector dialog on the user's browser, asking her to select a file for upload.
This can be utilized to upload an arbitrary file type to disk, since you can use this to upload the file into memory (by setting «typeFlags» to 7), and then using WriteBinaryFile to save it to the server's disk (again, setting «typeFlags» to 7). This trick can be utilized to upload a spreadsheet when using LibXL for reading spreadsheet data where Excel is not installed on the server, as is the case on Lumina's public ACP server. For security reasons, Excel is not installed on Lumina's public ACP server.
From ADE
When evaluated in the Analytica Decision Engine (ADE) and a dialog needs to be shown to the end-user, it calls IAdeUICallbacks::GetFilename(...). From within that callback, the parent application can interact with the end-user to resolve the file path, and a web applications can instruct the end-user to upload a file. Once complete, the callback returns the full path to the file which is then read. To receive this callback, the parent application must have previously registered the callback with ADE using CAEngine::SetCallbackObject( ).
Once the file read completes, it calls IAdeUICallbacks::FileOpenCompleted().
See Also
- In-memory binary data terms
- Binary and hexadecimal integer formats
- ReadTextFile
- WriteBinaryFile
- GetFromBinaryData
- Dec2Bin
- Hex2Bin
- Bin2Dec
- Bin2Hex
- BitAnd
- BitOr
- BitXOr
- BitNot
- BitShift -- these can be useful when you need to manipulate the bytes of multi-byte integers that you read.
Enable comment auto-refresher