Multi-role intelligent sound box companion system

文档序号:1186295 发布日期:2020-09-22 浏览:19次 中文

阅读说明:本技术 一种多角色智能音箱伴侣系统 (Multi-role intelligent sound box companion system ) 是由 闫钊杰 于 2020-05-28 设计创作,主要内容包括:本发明公开了一种多角色智能音箱伴侣系统,涉及智能音箱技术领域。本发明包括后台服务器、智能音箱和终端设备,智能音箱通信与多个非智能音箱进行数据连接;智能音箱用于通过移动存储终端设备导入的剧本;终端设备用于编写剧本并保存;后台服务器用于为智能音箱和终端设备提供资源和服务。本发明通过一台智能连接多态普通音箱,利用终端设备自定义剧本,智能音箱根据剧本内容生成播放时序和角色分配方案,剧本中只要标记需要智能对话的语句或角色,系统就会自动生成话术并转换成语音,引入智能音箱的网络资源,极大了增加了剧本的来源渠道,避免使用多个智能音箱降低成本。(The invention discloses a multi-role intelligent sound box companion system, and relates to the technical field of intelligent sound boxes. The intelligent sound box comprises a background server, an intelligent sound box and terminal equipment, wherein the intelligent sound box is in data connection with a plurality of non-intelligent sound boxes in a communication way; the intelligent sound box is used for importing the script through the mobile storage terminal equipment; the terminal equipment is used for compiling and storing the script; and the background server is used for providing resources and services for the intelligent loudspeaker box and the terminal equipment. The intelligent sound box is intelligently connected with the multi-state common sound box, the script is customized by the terminal equipment, the intelligent sound box generates the playing time sequence and the role distribution scheme according to the content of the script, as long as sentences or roles needing intelligent conversation are marked in the script, the system can automatically generate the conversation and convert the conversation into voice, network resources of the intelligent sound box are introduced, the source channel of the script is greatly increased, the use of a plurality of intelligent sound boxes is avoided, and the cost is reduced.)

1. The utility model provides a multi-angular intelligent sound box companion system which characterized in that: comprises that

The background server, the intelligent sound box and the terminal equipment are in communication connection with each other through the Internet;

the intelligent sound box is in communication with the plurality of non-intelligent sound boxes and is in data connection with the plurality of non-intelligent sound boxes; the intelligent sound box is used for a script imported through the mobile storage terminal equipment;

the terminal equipment comprises a mobile terminal and a fixed terminal; the terminal equipment is used for compiling and storing the script; the terminal equipment performs role setting, script compiling, role sound configuration, background music setting and intelligent conversation setting operation through internally set software;

and the background server is used for providing resources and services for the intelligent sound box and the terminal equipment.

2. The multi-persona smart speaker companion system of claim 1, wherein the smart speaker communication connections to the plurality of non-smart speakers include but are not limited to bluetooth, WIFI, and data lines; the mode of accessing the intelligent sound box to the internet comprises but is not limited to wireless network access point networking, wired broadband networking and mobile information system networking.

3. The multi-persona smart sound companion system of claim 1 wherein the mobile terminal includes but is not limited to a cell phone, a tablet, a palmtop; the fixed terminal includes, but is not limited to, a personal computer, a server, and a virtual machine.

4. The multi-role smart speaker companion system according to claim 1 or 3, wherein the mobile terminal can be directly connected to the smart speaker in a scenario without internet access, and the connection manner includes but is not limited to bluetooth, USB, infrared and data line.

5. The multi-role intelligent sound box partner system according to claim 1, wherein the process of compiling and importing the terminal device script is as follows:

step S1: the method comprises the steps of establishing connection between the existing intelligent sound box and a plurality of non-intelligent sound boxes, accessing the intelligent sound box to the Internet under the condition of accessing the Internet, and establishing connection between the intelligent sound box and a terminal through Bluetooth, USB or other modes if the intelligent sound box does not have the condition of accessing the Internet;

step S2: the method comprises the following steps that matched software on the terminal equipment is started, the intelligent sound box can actively report own state information, and all connected intelligent sound boxes or non-intelligent sound boxes can be seen on the matched software;

step S3: a new script can be selected on the matched software, and a script compiling page is entered; in the interface, a user can add script roles, compile script conversations, select background music and sound effects and script bridge sections applied by the background music and sound effects;

step S4: after the script is compiled, the user can select to store the script locally or directly download the script to the intelligent sound box, and after the selection is completed, the software can pack resources required by playing the script and execute user operation.

6. The multi-role intelligent sound box partner system according to claim 5, wherein in step S3, the sound source corresponding to the script character, the speaking speed and the volume of the character can be configured manually during the script character adding process.

7. The multi-persona smart sound box companion system of claim 5, wherein in step S4, the resources include, but are not limited to, user profiles, labeled script files, and resource files required for playing scripts.

8. The multi-role intelligent sound box partner system according to claim 5, wherein after the scenario is downloaded to the intelligent sound box, the intelligent sound box needs to be analyzed, configured and played, and the specific steps are as follows:

step T1: decompressing the file;

step T2: reading the configuration file, checking the integrity of the file, executing the step T3 if the file is correct, otherwise playing an error prompt tone to ask the user to download again;

step T3: reading the script content;

step T4: judging whether intelligent dialogue is needed, if so, generating the intelligent dialogue according to the script content, and then executing a step T5, and if not, directly executing a step T5;

step T5: generating a play time sequence and a role distribution scheme according to the script content, generating an audio file according to the script content by user configuration, and setting information such as sound source characteristics, sound size, play speed, background music, special effect music and the like of the audio by the intelligent sound box according to the user configuration in the process of generating the audio file;

step T6: checking whether the audio file, the playing time sequence file and the role distribution are complete, if so, executing the step T7, otherwise, confirming the missing file and regenerating;

step T7: distributing the audio file to the non-intelligent sound box according to the role distribution scheme;

step T8: sequentially controlling the non-intelligent sound boxes to play the audio files according to the playing time sequence;

step T9: and confirming whether the play of the script is finished or not, and playing a user prompt tone if the play is finished.

Technical Field

The invention belongs to the technical field of intelligent sound boxes, and particularly relates to a multi-role intelligent sound box companion system.

Background

With the development of artificial intelligence technology, more and more intelligent products are going into our lives. In recent years, the field of intelligent sound boxes is rapidly developed and is more and more favored by the public. Generally, a common smart sound box is a single body, and although a plurality of smart sound boxes are interconnected or a smart sound box is interconnected with other smart devices, a plurality of smart products can greatly increase the cost, and users of conversation scenarios among the smart products cannot be completely self-defined. Patents in this respect are: the publication number is [ CN105975622A ] < method and system for multi-role intelligent chatting ], the publication number is [ CN108924033A ] < method and system for social intelligent speaker interaction with multi-role participation >, and the like.

Some story machines also have role playing capability, but the principle is that the role playing function is realized by playing recorded different audios according to time sequence, and neither the story line can be customized nor intelligent conversation can be carried out. The related patents are: a story machine control method, a story playback system, and a storage medium, which are disclosed in publication No. CN11047923A, and a story machine, which is capable of playing the role of the story book, which is disclosed in publication No. CN 201984649U.

The multi-role intelligent sound box companion system is greatly different from a common intelligent sound box, and the system consists of one intelligent sound box and one or more common sound boxes, so that the cost is reduced compared with a system with a plurality of intelligent sound boxes interconnected; different from the traditional story machine system, the invention has the capability of self-defining story line, supports role playing and introduces the logic capability of the intelligent sound box, can simultaneously perform the story line and the intelligent dialogue line,

therefore, the system can meet the requirements of various scenes such as daily entertainment, script rehearsal, children education and the like, and can effectively solve the problems.

Disclosure of Invention

The invention aims to provide a multi-role intelligent sound box companion system, which is characterized in that a multi-role ordinary sound box is intelligently connected, a script is customized by using a terminal device, and an intelligent sound box generates a play time sequence and role distribution scheme according to the content of the script, so that the problem of high multi-role playing cost of the existing intelligent device is solved.

In order to solve the technical problems, the invention is realized by the following technical scheme:

the invention relates to a multi-role intelligent sound box companion system which comprises a background server, an intelligent sound box and a terminal device, wherein the background server, the intelligent sound box and the terminal device are in communication connection with each other through the Internet; the intelligent sound box is in communication with a plurality of non-intelligent sound boxes for data connection; the intelligent sound box is used for a script imported through the mobile storage terminal equipment; the terminal equipment comprises a mobile terminal and a fixed terminal; the terminal equipment is used for compiling and storing the script; the terminal equipment performs role setting, script compiling, role sound configuration, background music setting and intelligent conversation setting operation through internally set software; and the background server is used for providing resources and services for the intelligent sound box and the terminal equipment.

Preferably, the connection mode of the smart sound box communication and the plurality of non-smart sound boxes includes but is not limited to bluetooth, WIFI and data lines; the mode of accessing the intelligent sound box to the internet comprises but is not limited to wireless network access point networking, wired broadband networking and mobile information system networking.

Preferably, the mobile terminal includes, but is not limited to, a mobile phone, a tablet computer, a palm computer; the fixed terminal includes, but is not limited to, a personal computer, a server, and a virtual machine.

Preferably, the mobile terminal can be directly connected with the smart speaker in a scene without internet access, and the connection mode includes, but is not limited to, bluetooth, USB, infrared and data line.

Preferably, the process of writing and importing the terminal device script is as follows:

step S1: the method comprises the steps of establishing connection between the existing intelligent sound box and a plurality of non-intelligent sound boxes, accessing the intelligent sound box to the Internet under the condition of accessing the Internet, and establishing connection between the intelligent sound box and a terminal through Bluetooth, USB or other modes if the intelligent sound box does not have the condition of accessing the Internet;

step S2: the method comprises the following steps that matched software on the terminal equipment is started, the intelligent sound box can actively report own state information, and all connected intelligent sound boxes or non-intelligent sound boxes can be seen on the matched software;

step S3: a new script can be selected on the matched software, and a script compiling page is entered; in the interface, a user can add script roles, compile script conversations, select background music and sound effects and script bridge sections applied by the background music and sound effects;

step S4: after the script is compiled, the user can select to store the script locally or directly download the script to the intelligent sound box, and after the selection is completed, the software can pack resources required by playing the script and execute user operation.

Preferably, in step S3, the sound source corresponding to the script character, the speaking speed of the character, and the volume of the sound may be manually configured during the script character adding process.

Preferably, in step S4, the resource includes, but is not limited to, a user profile, a marked scenario file, and a resource file required for scenario playing.

Preferably, after the scenario is downloaded to the smart speaker, the smart speaker needs to perform parsing, configuration and playing, and the specific steps are as follows:

step T1: decompressing the file;

step T2: reading the configuration file, checking the integrity of the file, executing the step T3 if the file is correct, otherwise playing an error prompt tone to ask the user to download again;

step T3: reading the script content;

step T4: judging whether intelligent dialogue is needed, if so, generating the intelligent dialogue according to the script content, and then executing a step T5, and if not, directly executing a step T5;

step T5: generating a play time sequence and a role distribution scheme according to the script content, generating an audio file according to the script content by user configuration, and setting information such as sound source characteristics, sound size, play speed, background music, special effect music and the like of the audio by the intelligent sound box according to the user configuration in the process of generating the audio file;

step T6: checking whether the audio file, the playing time sequence file and the role distribution are complete, if so, executing the step T7, otherwise, confirming the missing file and regenerating;

step T7: distributing the audio file to the non-intelligent sound box according to the role distribution scheme;

step T8: sequentially controlling the non-intelligent sound boxes to play the audio files according to the playing time sequence;

step T9: and confirming whether the play of the script is finished or not, and playing a user prompt tone if the play is finished.

The invention has the following beneficial effects:

(1) according to the intelligent multi-state sound box, the intelligent sound box is intelligently connected, the script is customized by using the terminal equipment, the intelligent sound box generates a playing time sequence and a role distribution scheme according to the content of the script, and the cost is reduced by avoiding using a plurality of intelligent sound boxes;

(2) according to the invention, by the problem of self-defining a story line of an intelligent sound box, the system supports the introduction of a self-defining script, the function of the self-defining script is not limited to script content editing, role sound selection, speech rate configuration, music sound effect configuration and intelligent dialogue configuration, after the introduction is finished, the system can automatically convert the characters of the introduced script into voices corresponding to different roles, and the deduction of the story line of the script and the intelligent dialogue is conveniently and quickly realized;

(3) the intelligent sound box supports intelligent conversation, and when sentences or roles needing intelligent conversation are marked in the script, the system can automatically generate a conversation and convert the conversation into voice, introduce network resources of the intelligent sound box, support automatic search of story resources disclosed on the network, and also obtain the story resources from the background server, so that source channels of the script are greatly increased, and requirements of various scenes such as daily entertainment, script rehearsal, children education and the like are met.

Of course, it is not necessary for any product in which the invention is practiced to achieve all of the above-described advantages at the same time.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings used in the description of the embodiments will be briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art that other drawings can be obtained according to the drawings without creative efforts.

Fig. 1 is a flowchart of parsing, configuring and playing a scenario by an intelligent sound box;

FIG. 2 is a schematic structural diagram of a multi-role intelligent speaker companion system in an Internet access state;

FIG. 3 is a schematic structural diagram of a multi-role intelligent speaker companion system in a state of not accessing the Internet;

fig. 4 is a flowchart of the scenario composition and import of the terminal device.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

Referring to fig. 2-3, the present invention is a multi-role intelligent speaker companion system, which comprises a background server, an intelligent speaker and a terminal device,

the background server, the intelligent sound box and the terminal equipment are in communication connection with each other through the internet;

the intelligent sound box is in communication with the plurality of non-intelligent sound boxes for data connection, so that the cost is reduced; the intelligent sound box is used for importing the script through the mobile storage terminal equipment; it should be noted that the intelligent sound box may import the scenario through the mobile storage device, that is, the user edits the scenario through the terminal device without connecting the intelligent sound box, then stores the edited scenario, takes out the mobile storage device, and then accesses the storage device to the intelligent sound box to import the scenario, and in this way, the configuration of the intelligent sound box is software caching, that is, the configuration information when the intelligent sound box is connected to the intelligent sound box last time is determined, which belongs to another scenario importing way.

The terminal equipment comprises a mobile terminal and a fixed terminal; the terminal equipment is used for compiling and storing the script; the terminal equipment performs role setting, script compiling, role sound configuration, background music setting and intelligent conversation setting operation through internally set software; it should be noted that, only when the terminal can access the internet and can access the background server, the user may select, on the terminal software, the script resource provided by the server or the resource disclosed on the network for downloading the script.

And the background server is used for providing resources and services for the intelligent loudspeaker box and the terminal equipment.

The connection mode of the intelligent sound box communication and the plurality of non-intelligent sound boxes comprises but is not limited to Bluetooth, WIFI and a data line; the access modes of the intelligent sound box to the internet include but are not limited to wireless network access point access, wired broadband access and mobile information system access.

The mobile terminal includes but is not limited to a mobile phone, a tablet computer and a palm computer; fixed terminals include, but are not limited to, personal computers, servers, and virtual machines.

The mobile terminal can be directly connected with the intelligent sound box under the scene without internet access, and the connection mode includes but is not limited to Bluetooth, USB, infrared and data lines.

Referring to fig. 4, the process of editing and importing the scenario of the terminal device is as follows:

step S1: the method comprises the steps of establishing connection between the existing intelligent sound box and a plurality of non-intelligent sound boxes, accessing the intelligent sound box to the Internet under the condition of accessing the Internet, and establishing connection between the intelligent sound box and a terminal through Bluetooth, USB or other modes if the intelligent sound box does not have the condition of accessing the Internet;

step S2: the method comprises the following steps that matched software on the terminal equipment is started, the intelligent sound box can actively report own state information, and all connected intelligent sound boxes or non-intelligent sound boxes can be seen on the matched software;

step S3: a new script can be selected on the matched software, and a script compiling page is entered; in the interface, a user can add script roles, compile script conversations, select background music and sound effects and script bridge sections applied by the background music and sound effects;

step S4: after the script is compiled, the user can select to store the script locally or directly download the script to the intelligent sound box, and after the selection is completed, the software can pack resources required by playing the script and execute user operation.

In step S3, the sound source, the speaking speed of the character, and the volume corresponding to the script character may be manually configured during the script character adding process.

In step S4, the resources include, but are not limited to, a user profile, a marked scenario file, and a resource file required for scenario playing.

Referring to fig. 1, after the scenario is downloaded to the smart sound box, the smart sound box needs to be analyzed, configured, and played, which includes the following steps:

step T1: decompressing the file;

step T2: reading the configuration file, checking the integrity of the file, executing the step T3 if the file is correct, otherwise playing an error prompt tone to ask the user to download again;

step T3: reading the script content;

step T4: judging whether intelligent dialogue is needed, if so, generating the intelligent dialogue according to the script content, and then executing a step T5, and if not, directly executing a step T5;

step T5: generating a play time sequence and a role distribution scheme according to the script content, generating an audio file according to the script content by user configuration, and setting information such as sound source characteristics, sound size, play speed, background music, special effect music and the like of the audio by the intelligent sound box according to the user configuration in the process of generating the audio file;

step T6: checking whether the audio file, the playing time sequence file and the role distribution are complete, if so, executing the step T7, otherwise, confirming the missing file and regenerating;

step T7: distributing the audio file to the non-intelligent sound box according to the role distribution scheme;

step T8: sequentially controlling the non-intelligent sound boxes to play the audio files according to the playing time sequence;

step T9: and confirming whether the play of the script is finished or not, and playing a user prompt tone if the play is finished.

It should be noted that, in the above system embodiment, each included unit is only divided according to functional logic, but is not limited to the above division as long as the corresponding function can be implemented; in addition, specific names of the functional units are only for convenience of distinguishing from each other, and are not used for limiting the protection scope of the present invention.

In addition, it is understood by those skilled in the art that all or part of the steps in the method for implementing the embodiments described above may be implemented by a program instructing associated hardware, and the corresponding program may be stored in a computer-readable storage medium.

The preferred embodiments of the invention disclosed above are intended to be illustrative only. The preferred embodiments are not intended to be exhaustive or to limit the invention to the precise embodiments disclosed. Obviously, many modifications and variations are possible in light of the above teaching. The embodiments were chosen and described in order to best explain the principles of the invention and the practical application, to thereby enable others skilled in the art to best utilize the invention. The invention is limited only by the claims and their full scope and equivalents.

11页详细技术资料下载
上一篇:一种医用注射器针头装配设备
下一篇:语音合成方法、装置、计算机设备及计算机可读存储介质

网友询问留言

已有0条留言

还没有人留言评论。精彩留言会获得点赞!

精彩留言,会给你点赞!