Skip to main content

GET /job_results

Retrieves the metadata results of a finished video analysis job.

Endpoint

GET https://api-eu.valossa.com/core/1.0/job_results

Query Parameters

ParameterTypeRequiredDescription
api_keystringYesYour Valossa API key
job_idstringYesThe UUID of the finished job
typestringNoMetadata type to download. Default: core

Supported Metadata Types

Type ValueDescription
coreValossa Core metadata (default). Contains all detections, groupings, and segmentations.
frames_facesPer-frame face bounding box coordinates. See Faces & Identity.
seconds_objectsPer-second object bounding box coordinates. See Localized Objects.
frames_objectsPer-frame object bounding box coordinates. See Localized Objects.
speech_to_text_srtSpeech-to-text results in SRT subtitle format. See Speech & Transcription.
speech_to_text_vttSpeech-to-text results in WebVTT subtitle format.
speech_to_text_ttmlSpeech-to-text results in TTML subtitle format.
speech_to_text_avid_txtSpeech-to-text results in Avid text format.
visual_captionsVisual scene description metadata — natural language descriptions of what is visually happening in each scene. See Scene Descriptions Guide.

Response

  • For core, frames_faces, seconds_objects, frames_objects, and visual_captions: returns HTTP 200 with JSON content.
  • For subtitle output types (speech_to_text_srt, speech_to_text_vtt, speech_to_text_ttml, speech_to_text_avid_txt): returns HTTP 200 with text content.

visual_captions JSON Shape

visual_captions returns scene-description sections in a different structure than Core metadata:

{
"version_info": {
"metadata_type": "visual_captions",
"metadata_format_version": "1.2.0"
},
"selected_sections": [
{
"caption": "A close-up view of a car's side, focusing on the area near the rear window and the roof.",
"section": {
"s_start": 6.88,
"s_end": 9.96,
"shot_index": 3
}
}
]
}

Notes:

  • selected_sections[].caption is the natural-language scene description text.
  • selected_sections[].section.s_start and selected_sections[].section.s_end define the time range.
  • selected_sections[].section.shot_index defines the index of the corresponding shot. Shot numbering starts from 0.
  • Unlike Core metadata, this payload uses s_start / s_end instead of ss / se.

frames_objects and seconds_objects

Both object-coordinate result types are JSON payloads linked back to Core detection IDs for visual.object.localized.

  • seconds_objects is indexed by second.
  • frames_objects is the frame-level counterpart for the same kind of localized object data.
  • See Localized Objects for the coordinate model and linking pattern.

Subtitle/Text Result Types

These type values return text rather than JSON:

Type ValueOutput
speech_to_text_srtStandard SRT subtitle text
speech_to_text_vttStandard WebVTT subtitle text
speech_to_text_ttmlTTML XML subtitle text
speech_to_text_avid_txtAvid-compatible subtitle text

Examples

Download Core Metadata

curl "https://api-eu.valossa.com/core/1.0/job_results?api_key=YOUR_API_KEY&job_id=167d6a67-fb99-438c-a44c-c22c98229b93" \
-o core_metadata.json

Download Face Bounding Boxes

curl "https://api-eu.valossa.com/core/1.0/job_results?api_key=YOUR_API_KEY&job_id=167d6a67-fb99-438c-a44c-c22c98229b93&type=frames_faces" \
-o faces.json

Download Speech-to-Text SRT

curl "https://api-eu.valossa.com/core/1.0/job_results?api_key=YOUR_API_KEY&job_id=167d6a67-fb99-438c-a44c-c22c98229b93&type=speech_to_text_srt" \
-o subtitles.srt

Python

import requests

# Download core metadata
response = requests.get(
"https://api-eu.valossa.com/core/1.0/job_results",
params={
"api_key": "YOUR_API_KEY",
"job_id": "167d6a67-fb99-438c-a44c-c22c98229b93"
}
)
metadata = response.json()

# Download SRT transcript
srt_response = requests.get(
"https://api-eu.valossa.com/core/1.0/job_results",
params={
"api_key": "YOUR_API_KEY",
"job_id": "167d6a67-fb99-438c-a44c-c22c98229b93",
"type": "speech_to_text_srt"
}
)
srt_content = srt_response.text

JavaScript

// Download core metadata
const response = await fetch(
"https://api-eu.valossa.com/core/1.0/job_results?api_key=YOUR_API_KEY&job_id=167d6a67-fb99-438c-a44c-c22c98229b93"
);
const metadata = await response.json();

// Download SRT transcript
const srtResponse = await fetch(
"https://api-eu.valossa.com/core/1.0/job_results?api_key=YOUR_API_KEY&job_id=167d6a67-fb99-438c-a44c-c22c98229b93&type=speech_to_text_srt"
);
const srtContent = await srtResponse.text();

Example Core Metadata Response (Abbreviated)

{
"version_info": {
"metadata_type": "core",
"metadata_format": "1.8.1",
"backend": "3.1.1"
},
"job_info": {
"job_id": "167d6a67-fb99-438c-a44c-c22c98229b93",
"request": { ... }
},
"media_info": {
"technical": { "duration_s": 120.5, "fps": 24 }
},
"detections": { ... },
"detection_groupings": { ... },
"segmentations": { ... }
}

Notes

  • Store metadata locally. Metadata may not be stored permanently on Valossa servers, and download count limits may be imposed in the future.
  • The API returns JSON without line breaks or indentation. Use a tool like jq to pretty-print for human reading: cat metadata.json | python -m json.tool > pretty.json
  • SRT files use Unix newlines (LF only, not CRLF).
  • Not all metadata types may be available for every job. Availability depends on your subscription's AI features and when the job was analyzed.