Method for quickly reviewing call content

文档序号:1819977 发布日期:2021-11-09 浏览:19次 中文

阅读说明:本技术 一种快速回顾通话内容的方法 (Method for quickly reviewing call content ) 是由 徐时豪 于 2021-08-12 设计创作,主要内容包括:本发明公开了一种快速回顾通话内容的方法,步骤一、在通信终端上开启以文本形式记录通信内容的权限;步骤二、当通话已经建立并且授权以文本形式记录通信内容后,终端设备的语音收发模块提取收发多方的音频流发送至语音转文字功能模块;步骤三、语音转文字功能模块对音频流进行文字转换并生成文本,语音转文字功能模块将文本结果发送终端通话模块;步骤四、终端的通话模块在收到语音转文本的文本结果后,根据用户的需求实时显示在界面上。本发明将通话的内容实时转为文字,既可以供用户实时地查阅,也可以保存在终端或服务器上,以供后续使用。(The invention discloses a method for quickly reviewing call content, which comprises the steps of firstly, opening the authority for recording the communication content in a text form on a communication terminal; step two, after the call is established and the recording of the communication content in the text form is authorized, the voice transceiving module of the terminal equipment extracts audio streams of transceiving parties and sends the audio streams to the voice-to-text function module; thirdly, the voice-to-character functional module performs character conversion on the audio stream to generate a text, and the voice-to-character functional module sends a text result to the terminal communication module; and step four, after receiving the text result of the voice-to-text conversion, a call module of the terminal displays the text result on an interface in real time according to the requirements of the user. The invention converts the conversation content into characters in real time, and the characters can be consulted by a user in real time and also can be stored on a terminal or a server for subsequent use.)

1. A method for quickly reviewing call content, comprising:

step one, opening the authority for recording the communication content in a text form on a communication terminal;

step two, after the call is established and the recording of the communication content in the text form is authorized, the voice transceiving module of the terminal equipment extracts audio streams of transceiving parties and sends the audio streams to the voice-to-text function module;

thirdly, the voice-to-character functional module performs character conversion on the audio stream to generate a text, and the voice-to-character functional module sends a text result to the terminal communication module;

and step four, after receiving the text result of the voice-to-text conversion, a call module of the terminal displays the text result on an interface in real time according to the requirements of the user.

2. The method of claim 1, wherein the first step occurs before the call begins.

3. The method of claim 1, wherein the first step occurs after the call is connected.

4. The method according to claim 1, wherein the voice-to-text function module in step two is a local function, and the voice stream is directly sent to the local voice-to-text function module.

5. The method according to claim 1, wherein the voice-to-text function module in step two is a remote service, and a connection to the server needs to be initiated when the voice-to-text function module is started, and then the voice stream is sent to the voice-to-text function module of the server.

6. The method of claim 1, wherein the text results are stored in the local or remote server periodically after the terminal receives the text results in step four.

7. The method as claimed in claim 1, wherein the terminal in step four stores the received text results after the call is over.

8. The method as claimed in claim 1, wherein the terminal in step four receives the text result, calls other application software, and inputs the text to the called application software for further sub-processing.

9. The method of claim 1, wherein the step four terminal is configured with a keyword list.

10. The method of claim 9, wherein the terminal performs keyword matching according to the received text result.

Technical Field

The invention relates to the field of communication, in particular to a method for reviewing call content.

Background

Currently, for call records occurring on a communication device, the main contents stored are: the starting time and the ending time of the call, the number of the opposite end of the call and the user name. For some users with special requirements, real-time call recording can be provided and stored on a terminal or a server for subsequent use.

For users who need to save call content, the current solution has many disadvantages: 1. the user must be able to play back the saved content after the call is over, and the user cannot review the information of the earlier time of the call while the call is still in progress. 2. The audio files can mix and record the sounds of both parties of the call, which is inconvenient to distinguish. It is more difficult to distinguish if it is a multi-party call. 3. The time required to play back an audio file is relatively long and playing at double speed still requires at least half the time of the original call. 4. Audio files are not easily retrievable, and information for keywords of interest to a user is not easily retrieved within one or more audio files. 5. The audio file storage space is large, and a large amount of information of conversation is not easy to store for the communication terminal with limited memory capacity. 6. Audio files are also not conducive to multi-lingual translation.

Disclosure of Invention

In order to solve the problem of the existing call content review, it is necessary to introduce a method for quickly reviewing call content, so that the call content is converted into characters in real time, and the characters can be consulted by a user in real time and also can be stored on a terminal or a server for subsequent use.

In order to achieve the above purpose, the invention provides a method for quickly reviewing call content, which adopts the following technical scheme:

step one, opening the authority for recording the communication content in a text form on a communication terminal;

step two, after the call is established and the recording of the communication content in the text form is authorized, the voice transceiving module of the terminal equipment extracts audio streams of transceiving parties and sends the audio streams to the voice-to-text function module;

thirdly, the voice-to-character functional module performs character conversion on the audio stream to generate a text, and the voice-to-character functional module sends a text result to the terminal communication module;

and step four, after receiving the text result of the voice-to-text conversion, a call module of the terminal displays the text result on an interface in real time according to the requirements of the user.

Further, the first step occurs before the call is started.

Further, the first step occurs after the call is connected.

Further, the voice-to-text function module in the second step is a local function, and then the voice stream is directly sent to the local voice-to-text function module.

Further, the voice-to-text function module in the second step is a remote service, and then it is necessary to initiate connection to the server when starting voice-to-text, and then send a voice stream to the voice-to-text function module of the server.

Further, the terminal in the fourth step stores the text result in a local or remote server in a timed manner after receiving the text result.

Further, the terminal in the fourth step stores the received text results in a unified manner after the call is finished.

Further, after receiving the text result, the terminal in the fourth step calls other application software, and inputs the text into the called application software for next sub-processing.

Further, the terminal in the fourth step is configured with a keyword list.

Further, the terminal matches the keywords according to the received text result.

The method for quickly reviewing the call content provided by the invention has the following beneficial effects:

1. the user can review the information of the earlier time of the call in the process of still carrying out the call, and the user can play back the stored content in real time;

2. and searching for keywords in the audio text file, and searching for information of keywords concerned by the user in one or more audio text files.

3. The text file of the audio is stored to the terminal or the server, so that the storage capacity of the communication terminal or the server is greatly enhanced.

Drawings

Fig. 1 is a logic diagram of a method for quickly reviewing call content according to the present invention.

Detailed Description

The principles and spirit of the present invention will be described below with reference to several exemplary embodiments, which should be understood to be presented only to enable those skilled in the art to better understand and implement the present invention, and not to limit the scope of the present invention in any way. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the disclosure to those skilled in the art.

As will be appreciated by one skilled in the art, embodiments of the present invention may be embodied as a system, apparatus, device, method, or computer program product. Accordingly, the present disclosure may be embodied in the form of: entirely hardware, entirely software (including firmware, resident software, micro-code, etc.), or a combination of hardware and software.

According to the embodiment of the invention, the method for quickly reviewing the call content converts the call content into the characters in real time, so that the characters can be consulted by a user in real time and can also be stored on a terminal or a server for subsequent use.

For a clearer explanation of the method for quickly reviewing the call content, a specific embodiment is described below, but it should be noted that the embodiment is only for better explaining the present invention and is not to be construed as an undue limitation on the present invention.

As shown in fig. 1, a method for quickly reviewing call contents according to the present invention is implemented by the following steps:

step one, opening the authority for recording the communication content in the form of text on the communication terminal. The action can be performed before the call starts or after the call is connected;

step two, after the call is established and the recording of the communication content in the text form is authorized, the voice receiving and sending module of the terminal equipment extracts the audio streams of the receiving and sending parties and sends the audio streams to the functional module for converting the voice into words;

step three, the functional module of the voice to character can be the installed service of the terminal, and can also be the remote service based on the network; if the local service is available, the voice stream is directly sent to the local service module; if the service is the remote service, the connection to the server needs to be initiated at the moment of starting the voice to text conversion, and then the service requirement is sent;

and step four, after receiving the text result of the voice-to-text conversion, a call module of the terminal displays the text result on an interface in real time according to the requirements of the user, and if the text result has the requirement of real-time sharing, the text result can be stored in a local or remote server at regular time and can also be stored uniformly after the call is finished.

The user can view the content of the current call at an earlier time in real time, such as the information played by the opposite terminal when the call is just started, and the user can view the content directly on the screen without hearing clearly.

The text of the conversation content can provide richer services by matching with other software, and after the terminal obtains the text extracted from the voice, other application software is called, and the text is input into the application software for next sub-processing. For example, in cooperation with translation software, the result of converting speech into text can be sent to the translation software, and the translation result obtained in real time by the translation software is received and displayed on the interface of the terminal. Therefore, the translated content can be prompted to the user in real time.

The matching of the keywords can be carried out in real time, a keyword list can be configured in the terminal, and one or more trigger behavior plans are provided for each keyword; and the terminal software matches the text result with the keyword list in real time, and if a certain keyword is matched, a corresponding action can be triggered. For example, the following keyword list may be preset at a central office of a hotel: reservation, starting a computer reservation program, ordering, transferring a catering department, calling morning, starting a calling morning program and the like. If the user calls the telephone number and mentions that the user wants to reserve the room, the terminal finds the word of reserving in the text obtained by voice recognition, and then the terminal can send information to the computer of the switchboard to start the room reservation program to help the switchboard staff to quickly finish the reservation. Moreover, because the texts are generated according to the voice channels respectively, the next operation can be easily performed by setting who says the keywords. Such as: the user mentions "order" to initiate the order process. For another example, in a customer call center of a telecommunication, a user calls to say that the broadband speed of the user is abnormal, the terminal recognizes the keyword of the broadband speed, immediately sends the keyword of the broadband speed to the consultation software of the computer of the customer service, and automatically calls out a solution about the common problem of the broadband speed from the computer software for the customer service to refer to. The call center may also consult with relevant experts who have customer service pre-configured with keywords like "broadband speed". When the broadband speed appears in the call, the terminal interface can jump out of the prompt box to inquire whether the customer service needs to transfer the call to the expert customer service with the broadband speed for answering. Another application scenario is as follows: when a call center of an insurance company receives a call, the terminal can automatically search a single number in a real-time text of the call, after the single number is searched, the alphanumerics with a specific length appearing after the characters of the single number are extracted and are sent to a computer of a customer service as the insurance policy number, and the customer service can immediately see the relevant information about the insurance policy.

One advantage of this solution is that after the speech is converted into text, it can be shared in multiple software, for example, the hotel desk in the previous example, i.e. the translation function can be started, and also the keyword service can be started at the same time, thus greatly expanding the functions of the terminal. The text extracted in the call can be stored in the local or server together with key contents such as information of a caller, call time and the like, the text file not only occupies small space, but also has high reading speed, is convenient to search in one or more files, and provides great convenience for users. For example, telecommunication service calls are stored in text form and can be easily searched if a worker wants to count how many customers complain of broadband speed over a period of time.

While the spirit and principles of the invention have been described with reference to several particular embodiments, it is to be understood that the invention is not limited to the disclosed embodiments, nor is the division of aspects, which is for convenience only as the features in such aspects may not be combined to benefit. The invention is intended to cover various modifications and equivalent arrangements included within the spirit and scope of the appended claims.

The limitation of the protection scope of the present invention is understood by those skilled in the art, and various modifications or changes which can be made by those skilled in the art without inventive efforts based on the technical solution of the present invention are still within the protection scope of the present invention.

6页详细技术资料下载
上一篇:一种医用注射器针头装配设备
下一篇:用于获取录音系统异常信息的方法及装置、电子设备、存储介质

网友询问留言

已有0条留言

还没有人留言评论。精彩留言会获得点赞!

精彩留言,会给你点赞!

技术分类