Skip to main content

Metadata Changelog

Version history of the Valossa Core metadata JSON format. The version number follows x.y.z where changes to only z (patch) are purely additive and will not break existing parsers.

Version History

VersionChanges
1.8.1Added audio.speech_detailed.stats detection type. Language code added to audio.speech_detailed detections where language information is available.
1.8.0Removed lang attribute from visual.text_region.* detection types.
1.7.2Added audio.speech_summary.keyword and topic.genre detection types.
1.7.1Added audio.speech_summary detection type.
1.7.0Modified topic.iab.section to support Advertisability Score (Ad Score). Changed source attribute to sources attribute supporting multiple source modalities.
1.6.0Added under_18_years attribute to human.face detections. Added refined_from_multiple_detection_types:people_under_18_years to by_detection_property. Removed age_years from human.face detections.
1.5.3Added quality attribute to human.face detections.
1.5.2Added new Valossa category tags for topic.iab and topic.iab.section. Updated IAB taxonomy version to 2.2.
1.5.1Added SCD (Special Category Data) support to IAB detections with IAB taxonomy version 2.1.
1.5.0Voice emotion moved from seconds-based audio.speech data into the new dedicated audio.voice_emotion detection type. Previous format deprecated.
1.4.3Added topic.iab.section detection type. Added voice emotion to seconds-based audio.speech data. Added additional identity information to human.face.
1.4.2Added audio.speech_detailed detection type.
1.4.1Added OCR detection types: visual.text_region.full_frame_analysis, visual.text_region.lower_third, visual.text_region.middle_third, visual.text_region.upper_third.
1.4.0Some tag category identifiers were changed.
1.3.12Added face height per second to by_second data for human.face detections.
1.3.11Added by_frequency grouping, topic.iab and topic.general detection types.
1.3.10Added shs (shot start) to occurrences.
1.3.9Added n_audio_channels to technical media information.
1.3.8Added visual.object.localized detection type.
1.3.7Changed representation of bitrate (bps) in technical media information.
1.3.6Added resolution, codecs, and bitrates to technical media information.
1.3.5Added visual.color detection type. Added violence-related concept categories.
1.3.4Added by_detection_property grouping. Added identifier information for gallery faces.
1.3.3Added categ field to relevant visual.context and audio.context detections.
1.3.2Added role name support to similar_to in human.face detections.
1.3.1Added metadata_type field to distinguish between metadata types.
1.3.0Improved speech-to-text format.
1.2.1Added speech-to-text support.
1.2.0Improved field naming.
1.1.0More compact format.
1.0.0Major release. Deprecated version 0.6.1.