Skip to main content
Version: v24.05

Parse GenDC data

In this tutorial, we learn how to use GenDC separator library. If your device data format is non-GenDC (general camera acquire images), see the next tutorial page Parse non-GenDC binary data.

If your device is not in GenDC format but you would like to learn about, you can Download sample GenDC which has 6 valid components out of 9 components with 4 different sensor data.

Prerequisite

  • GenDC Separator (installed with sensing-dev SDK)
  • GenDC Data (obtained in the previous tutorial or Download sample from this page).

Tutorial

In the previous tutorial, we learned how to save GenDC data into a binary file. Now, we will load the data, parse the container, and retrieve some information about the sensor from its descriptor.

note

We assume that GenDC used in this tutorial contains at lease one image data component. If your saved data in the previous tutorial, please download sample data from this page.

GenDC

GenDC, or Generic Data Container, is defined by the EMVA (European Machine Vision Association). True to its name, it can contain any types of data defined by the camera device, regardless of the data dimension, metadata, or whether they are image sequences/bursts.

While the format rule is defined in the official document, GenDC Separator helps you with easily parsing the whole container.

If you would like to learn the overview of GenDC, please check this page that gives you the concept and rough structure of GenDC.

Find Binary file.

If you use the binary file saved in the previous tutorial, the name of the directory should be tutorial_save_gendc_XXXXXXXXXXXXXX and binary file prefix is gendc0-.

std::string directory_name = "tutorial_save_gendc_XXXXXXXXXXXXXX";
std::string prefix = "gendc0-";
info

The complete code provided at the end of this tutorial gives you the command line option.

If you set -d tutorial_save_gendc_XXXXXXXXXXXXXX or --directory tutorial_save_gendc_XXXXXXXXXXXXXX option when you execute the tutorial program, directory_name will be automatically set.

Also, if you are using sample data output.bin provided at the top of this tutorial page, you can simply add -u or --use-dummy-data option so that the program look for downloaded output.bin.

The following snippet attempts to retrieve all binary files starting with a specified prefix from a directory. It then reorders all the found binaries according to their recorded order.

std::vector<std::string> bin_files;
for (const auto& entry : std::filesystem::directory_iterator(directory_name)) {
if (entry.path().filename().string().find(prefix) == 0 && entry.is_regular_file() && entry.path().extension() == ".bin") {
bin_files.push_back(entry.path().filename().string());
}
}

//re-order binary files to sensor0-0.bin, sensor0-1.bin, sensor0-2.bin...
std::sort(bin_files.begin(), bin_files.end(), [](const std::string& a, const std::string& b) {
return extractNumber(a) < extractNumber(b);
});

Now, we go through all ordered binary files in bin_files with for loop.

for (const auto& filename : bin_files){

}

Open and load Binary file.

In the for loop, our target (single) binary file is filename. We open this binary file is ifstream.

std::filesystem::path jth_bin= std::filesystem::path(directory_name) / std::filesystem::path(filename);
std::ifstream ifs(jth_bin, std::ios::binary);

To obtain the size of whole binary file, you set the ifstream at the end of file so that the distance between the current position and the beginning of the file is equal to the filesize.

ifs.seekg(0, std::ios::end);
std::streampos filesize = ifs.tellg();

Do not forget to put the ifstream back to the beginning of the file to load the content.

ifs.seekg(0, std::ios::beg);
char* filecontent = new char[filesize];

Parse binary file

GenDC Separator has isGenDC to check if the data has GenDC signature. Before parsing the whole data, it is always a good idea to make sure if the data is actually saved as GenDC format.

isGenDC(filecontent);

If it returns true, we can create an GenDC ContainerHeader object from the data. To learn some terminologies of GenDC, such as Container, Component, and Part, please check this page.

ContainerHeader gendc_descriptor = ContainerHeader(filecontent);

Now this object contains all information written in the GenDC Descriptor. You can get the size of Descriptor and the size of data.

int32_t descriptor_size = gendc_descriptor.getDescriptorSize();
int64_t container_data_size = gendc_descriptor.getDataSize();

The whole container size is the summation of this DescriptorSize and DataSize, so if you want to load the next container information, you can just add the total as the offset of the original data.

ContainerHeader next_gendc_descriptor= ContainerHeader(filecontent + descriptor_size + data_size);

In this tutorial, we will detect which Component has image sensor data, get some properties of the sensor data such as 1. how many channels 2. Dimension of data, and 3. Byte-depth of data, and finally display the image with OpenCV.

First of all, we need to find the first available image data Component with getFirstComponentIndexByTypeID(), which returns the index of the first available data Component if its datatype matches the parameter. If it returns -1, it means no valid data is set on the sensor side.

Here are some datatype difined by GenICam.

Datatype keyDatatype ID Value
Undefined0
Intensity1
Infrared2
Ultraviolet3
Range4
......
Metadata0x8001

reference: 4.13ComponentIDValue on GenICam Standard Features Naming Convention

Since we want to get image (i.e. intensity) data, use 1 for Datatype ID Value.

// get first available image component
int32_t image_component_index = gendc_descriptor.getFirstComponentIndexByTypeID(1);

Now, we can access the header information of the Component that contains the image data.

ComponentHeader image_component = gendc_descriptor.getComponentByIndex(image_component_index);

The Component has one or more Parts. We can iterate through them using a for loop. The number of Parts for image Components are usually the number of color channel. e.g. 1 for Monochrome-space image and 3 for general color-pixel-format such as RGB.

int part_count = image_component.getPartCount();
for (int idx = 0; idx < part_count; idx++) {
PartHeader part = image_component.getPartByIndex(idx);
int part_data_size = part.getDataSize();

To copy image data, we need to create a buffer to store the data in each Part.

uint8_t* imagedata;
imagedata = new uint8_t [part_data_size];
part.getData(reinterpret_cast<char *>(imagedata));

Currently, the image data is in a 1D array format imagedata. To display the preview image, we can reshape it by setting the following information:

  • Width
  • Height
  • Color-channel
  • Byte-depth

getDimension() returns a vector containing the width and height.

std::vector <int32_t> image_dimension = part.getDimension();

To determine the byte-depth, you can calculate it from the total size of the data and the obtained dimension values above.

int32_t bd = part_data_size / WxH;

Now, we copy the data from 1D array imagedata to the image-formatized cv::Mat img with memcpy to display:

cv::Mat img(image_dimension[1], image_dimension[0], CV_8UC1);
std::memcpy(img.ptr(), imagedata, datasize);
cv::imshow("First available image component", img);

cv::waitKeyEx(1);

Note that CvMat type system varies depends on the byte-depth and color-channel. Generally the format is CV_<bit-depth><signed/unsigned>C<number of channel>. See the detail in official document.

If you use sample data provided at the top of this page, you may see the image like this:

sample image.

Example with Sample Data

In the previous section, we learned general idea of parsing GenDC data with GenDC Separator API. Now, we can use sample data provided at the top of this page (or here)) to handle some non-image data.

Sample Data structure.

As we have done for image sensor data, we can get 1. how many channels 2. Dimension of data, and 3. Byte-depth of data. We also provide visualize tutorial code for Python version Visualize GenDC data.

Since all non-image data's Datatype is Metadata, we need another information to access each component. So, finding the target Component by TypeId may not be a good idea.

Usually, each Component has uniquie SourceId to detect the data. This time, we use getFirstComponentIndexBySourceId to look for the target Component.

Component IndexSensor typeValiditySourceIdTypeId
0Imagevalid0x10011 (Intensity)
1Audiovalid0x20010x8001 (Metadata)
2Analog 1valid0x30010x8001 (Metadata)
3Analog 2valid0x30020x8001 (Metadata)
4Analog 3valid0x30030x8001 (Metadata)
5PMODvalid0x40010x8001 (Metadata)
6extrainvalid0x00010x8001 (Metadata)
7N/Ainvalid0x50010x8001 (Metadata)
8N/Ainvalid0x60010x8001 (Metadata)

Obtain audio data

Once we know the index of target Component, we can create ComponentHeader object as well as we did in image Component.

int audio_component_index = gendc_descriptor.getFirstComponentIndexBySourceId(0x2001);
ComponentHeader audio_component = gendc_descriptor.getComponentByIndex(audio_component_index);

This Container has 2 Parts and they have Left and Right channel of audio data respectively.

int audio_part_count = audio_component.getPartCount();
info

Some of audio sensor uses interleaved audio, which store Lch and Rch in the same Part in turns. In this case, audio_component.getPartCount() returns 1, but you need to re-shape the data into 2-dim to use it.

As we did for image Component, we check the data size and dimension of each Part.

for (int idx = 0; idx < audio_part_count; idx++) {
PartHeader part = audio_component.getPartByIndex(idx);
int part_data_size = part.getDataSize();
std::vector <int32_t> audio_dimension = part.getDimension();

Here, audio_dimension is {800}. Typically, the image Component of GenDC stores one frame of data, while other sensor data store samples obtained during the time it takes to acquire one frame of that image data. For example, if the images are being acquired at 60fps, only the samples obtained within 1/60 of a second are stored. This audio data is sampled at 48kHz, and since 800 samples are acquired within 1/60 of a second, the dimension becomes 800.

To know the data-type of this audio data, you need the byte-depth, which can be calculated with the size of data and dimension.

int32_t bd = part_data_size / 800;

Since part_data_size is 1600, we now know the byte-depth is 2, so the data-type is int16_t.

If you visualize this data with Python tutorial code, you may see the following plot:

sample image.

Obtain analog data

This GenDC data has 3 analog sensors which has identifier 0x3001, 0x3002, and 0x3003. We can find the index of Component with getFirstComponentIndexBySourceId() as well.

int analog_component_index = gendc_descriptor.getFirstComponentIndexBySourceId(0x3001);
ComponentHeader analog_component = gendc_descriptor.getComponentByIndex(analog_component_index);

Each analog sensor's * has a single Part*, which can be confirmed with getPartCount().

int analog_part_count = analog_component.getPartCount();

You can get Part to know data size, dimension (number of samples), and byte-depth as well.

for (int idx = 0; idx < analog_part_count; idx++) {
PartHeader part = analog_component.getPartByIndex(idx);
int part_data_size = part.getDataSize();
std::vector <int32_t> analog_dimension = part.getDimension();
}

If you visualize this data with Python tutorial code, you may see the following plot:

sample image.

Obtain PMOD data

This GenDC data has 1 PMOD accelerometer sensor, which is connected to accelerometer recording the x, y, and z coordinate information. Therefore, the number of Parts is 3.

int pmod_component_index = gendc_descriptor.getFirstComponentIndexBySourceId(0x4001);
ComponentHeader pmod_component = gendc_descriptor.getComponentByIndex(pmod_component_index);
int pmod_part_count = pmod_component.getPartCount();
info

Some of accelerometer sensor uses interleaved structure, which store X, Y, and Z coordinate in the same Part in turns. In this case, pmod_component.getPartCount() returns 1, but you need to re-shape the data into 3-dim (or 4-dim with 1 dummy channel) to use it.

You can get Part to know data size, dimension (number of samples), and byte-depth as well.

for (int idx = 0; idx < pmod_part_count; idx++) {
PartHeader part = pmod_component.getPartByIndex(idx);
int part_data_size = part.getDataSize();
std::vector <int32_t> pmod_dimension = part.getDimension();
...

Since each Part have x, y, and z data respectively, with Python tutorial code, you may see the following plot:

sample image.

Use Typespecific information

If you want to access some device-specific data stored in TypeSpecific field of GenDC Part.

For example, the sample GenDC data has framecount data at the lower 4 bytes of the 8-byte TypeSpecific3, which represents the device-generated unique id for each frame.

Note that TypeSpecific starts from N = 1, 2, 3... and index is 0, 1, 2... so the index of TypeSpecific3 is 2.

int64_t typespecific3 = part.getTypeSpecificByIndex(2);
int32_t framecount = static_cast<int32_t>(typespecific3 & 0xFFFFFFFF);
std::cout << "Framecount: " << framecount<< std::endl;

Complete code

Complete code used in the tutorial is here.