Metadata Changelog

Version history of the Valossa Core metadata JSON format. The version number follows x.y.z where changes to only z (patch) are purely additive and will not break existing parsers.

Version History

Version	Changes
1.8.1	Added `audio.speech_detailed.stats` detection type. Language code added to `audio.speech_detailed` detections where language information is available.
1.8.0	Removed `lang` attribute from `visual.text_region.*` detection types.
1.7.2	Added `audio.speech_summary.keyword` and `topic.genre` detection types.
1.7.1	Added `audio.speech_summary` detection type.
1.7.0	Modified `topic.iab.section` to support Advertisability Score (Ad Score). Changed `source` attribute to `sources` attribute supporting multiple source modalities.
1.6.0	Added `under_18_years` attribute to `human.face` detections. Added `refined_from_multiple_detection_types:people_under_18_years` to `by_detection_property`. Removed `age_years` from `human.face` detections.
1.5.3	Added `quality` attribute to `human.face` detections.
1.5.2	Added new Valossa category tags for `topic.iab` and `topic.iab.section`. Updated IAB taxonomy version to 2.2.
1.5.1	Added SCD (Special Category Data) support to IAB detections with IAB taxonomy version 2.1.
1.5.0	Voice emotion moved from seconds-based `audio.speech` data into the new dedicated `audio.voice_emotion` detection type. Previous format deprecated.
1.4.3	Added `topic.iab.section` detection type. Added voice emotion to seconds-based `audio.speech` data. Added additional identity information to `human.face`.
1.4.2	Added `audio.speech_detailed` detection type.
1.4.1	Added OCR detection types: `visual.text_region.full_frame_analysis`, `visual.text_region.lower_third`, `visual.text_region.middle_third`, `visual.text_region.upper_third`.
1.4.0	Some tag category identifiers were changed.
1.3.12	Added face height per second to `by_second` data for `human.face` detections.
1.3.11	Added `by_frequency` grouping, `topic.iab` and `topic.general` detection types.
1.3.10	Added `shs` (shot start) to occurrences.
1.3.9	Added `n_audio_channels` to technical media information.
1.3.8	Added `visual.object.localized` detection type.
1.3.7	Changed representation of bitrate (bps) in technical media information.
1.3.6	Added resolution, codecs, and bitrates to technical media information.
1.3.5	Added `visual.color` detection type. Added violence-related concept categories.
1.3.4	Added `by_detection_property` grouping. Added identifier information for gallery faces.
1.3.3	Added `categ` field to relevant `visual.context` and `audio.context` detections.
1.3.2	Added role name support to `similar_to` in `human.face` detections.
1.3.1	Added `metadata_type` field to distinguish between metadata types.
1.3.0	Improved speech-to-text format.
1.2.1	Added speech-to-text support.
1.2.0	Improved field naming.
1.1.0	More compact format.
1.0.0	Major release. Deprecated version 0.6.1.

Version History​

Version History