| | |
| | | `text`: the text output of speech recognition |
| | | `is_final`: indicating the end of recognition |
| | | `timestamp`:If AM is a timestamp model, it will return this field, indicating the timestamp, in the format of "[[100,200], [200,500]]" |
| | | `stamp_sents`:If AM is a timestamp model, it will return this field, indicating the stamp_sents, in the format of "[{'text':'正 是 因 为','start':'430','end':'1130','ts_list':[[430,670],[670,810],[810,1030],[1030,1130]]}]" |
| | | `stamp_sents`:If AM is a timestamp model, it will return this field, indicating the stamp_sents, in the format of "[{'text_seg':'正 是 因 为','punc':',','start':'430','end':'1130','ts_list':[[430,670],[670,810],[810,1030],[1030,1130]]}]" |
| | | ``` |
| | | |
| | | ## Real-time Speech Recognition |
| | |
| | | `text`: the text output of speech recognition |
| | | `is_final`: indicating the end of recognition |
| | | `timestamp`:If AM is a timestamp model, it will return this field, indicating the timestamp, in the format of "[[100,200], [200,500]]" |
| | | `stamp_sents`:If AM is a timestamp model, it will return this field, indicating the stamp_sents, in the format of "[{'text':'正 是 因 为','start':'430','end':'1130','ts_list':[[430,670],[670,810],[810,1030],[1030,1130]]}]" |
| | | `stamp_sents`:If AM is a timestamp model, it will return this field, indicating the stamp_sents, in the format of "[{'text_seg':'正 是 因 为','punc':',','start':'430','end':'1130','ts_list':[[430,670],[670,810],[810,1030],[1030,1130]]}]" |
| | | ``` |
| | |
| | | `text`:表示语音识别输出文本 |
| | | `is_final`:表示识别结束 |
| | | `timestamp`:如果AM为时间戳模型,会返回此字段,表示时间戳,格式为 "[[100,200], [200,500]]"(ms) |
| | | `stamp_sents`:如果AM为时间戳模型,会返回此字段,表示句子级别时间戳,格式为 "[{'text':'正 是 因 为','start':'430','end':'1130','ts_list':[[430,670],[670,810],[810,1030],[1030,1130]]}]" |
| | | `stamp_sents`:如果AM为时间戳模型,会返回此字段,表示句子级别时间戳,格式为 "[{'text_seg':'正 是 因 为','punc':',','start':'430','end':'1130','ts_list':[[430,670],[670,810],[810,1030],[1030,1130]]}]" |
| | | ``` |
| | | |
| | | ## 实时语音识别 |
| | |
| | | `text`:表示语音识别输出文本 |
| | | `is_final`:表示识别结束 |
| | | `timestamp`:如果AM为时间戳模型,会返回此字段,表示时间戳,格式为 "[[100,200], [200,500]]"(ms) |
| | | `stamp_sents`:如果AM为时间戳模型,会返回此字段,表示句子级别时间戳,格式为 "[{'text':'正 是 因 为','start':'430','end':'1130','ts_list':[[430,670],[670,810],[810,1030],[1030,1130]]}]" |
| | | `stamp_sents`:如果AM为时间戳模型,会返回此字段,表示句子级别时间戳,格式为 "[{'text_seg':'正 是 因 为','punc':',','start':'430','end':'1130','ts_list':[[430,670],[670,810],[810,1030],[1030,1130]]}]" |
| | | ``` |
| | |
| | | } |
| | | } |
| | | // format |
| | | ts_sent += "{'text':'" + text_seg + "',"; |
| | | ts_sent += "{'text_seg':'" + text_seg + "',"; |
| | | ts_sent += "'punc':'" + characters[idx_str] + "',"; |
| | | ts_sent += "'start':'" + to_string(start) + "',"; |
| | | ts_sent += "'end':'" + to_string(end) + "',"; |
| | | ts_sent += "'ts_list':" + VectorToString(ts_seg) + "}"; |
| | |
| | | end = ts_seg[ts_seg.size()-1][1]; |
| | | } |
| | | // format |
| | | ts_sent += "{'text':'" + text_seg + "',"; |
| | | ts_sent += "{'text_seg':'" + text_seg + "',"; |
| | | ts_sent += "'punc':'',"; |
| | | ts_sent += "'start':'" + to_string(start) + "',"; |
| | | ts_sent += "'end':'" + to_string(end) + "',"; |
| | | ts_sent += "'ts_list':" + VectorToString(ts_seg) + "}"; |