Audio JSON Format
- 28 Feb 2024
- Print
- DarkLight
- PDF
Audio JSON Format
- Updated On 28 Feb 2024
- Print
- DarkLight
- PDF
Article summary
Did you find this summary helpful?
Thank you for your feedback
Overview
Dataloop's Audio transcription studio allows creating and editing audio transcription. The annotations JSON format is similar to Video annotation, where annotations also span across time, but simpler, since there are no annotation coordinates.
Format Details
{
"annotations": [
{
"id": "632894ed22e1200334d39638",
"datasetId": "6281eab332cd64b1c004319a",
"itemId": "6282247746a76b5e12ddb4fe",
"url": "https://gate.dataloop.ai/api/v1/annotations/632894",
"item": "https://gate.dataloop.ai/api/v1/items/6282247746a76e",
"dataset": "https://gate.dataloop.ai/api/v1/datasets/6281eab332cd19a",
"type": "subtitle",
"label": "Relaxed",
"attributes": [
],
"metadata": {
"system": {
"attributes": {
"1": 0.7,
"2": "Flase"
},
"openAnnotationVersion": "1.48.2-rc.67",
"recipeId": "6281eab46551fa03a3fa9ae8"
},
"user": {
"karaokeData": [
{
"confidence": 0.38077378,
"endTime": 2.88,
"startTime": 0.02,
"text": "I"
},
{
"confidence": 0.47150243,
"endTime": 4.65,
"startTime": 2.88,
"text": "Am"
},
{
"confidence": 0.96968557,
"endTime": 6.997,
"startTime": 4.86,
"text": "the"
},
{
"confidence": 0.39305196,
"endTime": 8.2,
"startTime": 6.998,
"text": "annotation"
},
{
"confidence": 0.70361846,
"endTime": 10.12,
"startTime": 8.251,
"text": "Transcription"
}
]
}
},
"creator": "shlomi@dataloop.ai",
"createdAt": "2022-09-19T16:12:29.879Z",
"updatedBy": "shlomi@dataloop.ai",
"updatedAt": "2022-09-19T16:12:29.879Z",
"hash": "73cfecbe0e9cf8e562ec18b3c058d40ddc400372",
"source": "ui",
"coordinates": {
"text": "I am the annotation Transcription"
}
Dictionary Table
Key Name | Definition | Parent Key |
---|---|---|
annotations | List of annotations | N/A |
id | Annotations ID | annotations |
datasetId | Dataset ID | annotations |
type | Annotation type - 'subtitle' for audio transcription | annotations |
label | The annotation's label/class | annotations |
metadata | This key holds all of the annotation information | annotations |
system | This key holds all of the annotation system information | metadata |
isOnlyLocal | A field used in the UI to determine if the annotation is ready to be saved or not (False – ready to be saved) | system |
Attributes | system | |
system | True - the system created this specific annotation False - annotation was created on a different way | system |
itemlinks | system | |
openAnnotationVersion | product version | system |
recipeID | ID of recipe used in this task | system |
user | Metadata that can be added by user via SDK, also used for storing word-level-timing information, which is imported into the platform and therefore considered user information | annotations |
Confidence | Transcription confidence, as generated by source model | system |
End time | Transcription word level end time, in seconds, out of the audio file lenght | system |
Start time | Transcription word level start time, in seconds, out of the audio file lenght | system |
Text | Transcription text included in this timing section | system |
creator | Annotation creator | annotations |
createdAt | Annotation creation date and time | annotations |
updatedBy | Annotation edits by user name | annotations |
updatedAt | Annotation edits date and time | annotations |
hash | Unique hash for this annotation | annotations |
Source | Indicates 'UI' when annotation was manually created | system |
Coordinates | Contains optional word-level timing information for audio transcription | system |