Metadata Changelog
Version history of the Valossa Core metadata JSON format. The version number follows x.y.z where changes to only z (patch) are purely additive and will not break existing parsers.
Version History
| Version | Changes |
|---|---|
| 1.8.1 | Added audio.speech_detailed.stats detection type. Language code added to audio.speech_detailed detections where language information is available. |
| 1.8.0 | Removed lang attribute from visual.text_region.* detection types. |
| 1.7.2 | Added audio.speech_summary.keyword and topic.genre detection types. |
| 1.7.1 | Added audio.speech_summary detection type. |
| 1.7.0 | Modified topic.iab.section to support Advertisability Score (Ad Score). Changed source attribute to sources attribute supporting multiple source modalities. |
| 1.6.0 | Added under_18_years attribute to human.face detections. Added refined_from_multiple_detection_types:people_under_18_years to by_detection_property. Removed age_years from human.face detections. |
| 1.5.3 | Added quality attribute to human.face detections. |
| 1.5.2 | Added new Valossa category tags for topic.iab and topic.iab.section. Updated IAB taxonomy version to 2.2. |
| 1.5.1 | Added SCD (Special Category Data) support to IAB detections with IAB taxonomy version 2.1. |
| 1.5.0 | Voice emotion moved from seconds-based audio.speech data into the new dedicated audio.voice_emotion detection type. Previous format deprecated. |
| 1.4.3 | Added topic.iab.section detection type. Added voice emotion to seconds-based audio.speech data. Added additional identity information to human.face. |
| 1.4.2 | Added audio.speech_detailed detection type. |
| 1.4.1 | Added OCR detection types: visual.text_region.full_frame_analysis, visual.text_region.lower_third, visual.text_region.middle_third, visual.text_region.upper_third. |
| 1.4.0 | Some tag category identifiers were changed. |
| 1.3.12 | Added face height per second to by_second data for human.face detections. |
| 1.3.11 | Added by_frequency grouping, topic.iab and topic.general detection types. |
| 1.3.10 | Added shs (shot start) to occurrences. |
| 1.3.9 | Added n_audio_channels to technical media information. |
| 1.3.8 | Added visual.object.localized detection type. |
| 1.3.7 | Changed representation of bitrate (bps) in technical media information. |
| 1.3.6 | Added resolution, codecs, and bitrates to technical media information. |
| 1.3.5 | Added visual.color detection type. Added violence-related concept categories. |
| 1.3.4 | Added by_detection_property grouping. Added identifier information for gallery faces. |
| 1.3.3 | Added categ field to relevant visual.context and audio.context detections. |
| 1.3.2 | Added role name support to similar_to in human.face detections. |
| 1.3.1 | Added metadata_type field to distinguish between metadata types. |
| 1.3.0 | Improved speech-to-text format. |
| 1.2.1 | Added speech-to-text support. |
| 1.2.0 | Improved field naming. |
| 1.1.0 | More compact format. |
| 1.0.0 | Major release. Deprecated version 0.6.1. |