Audio playing method, audio playing device and audio playing system

文档序号:1941844 发布日期:2021-12-07 浏览:14次 中文

阅读说明:本技术 音频播放方法、音频播放装置和音频播放系统 (Audio playing method, audio playing device and audio playing system ) 是由 孙宇鹏 于 2020-08-11 设计创作,主要内容包括:本发明公开了一种音频播放方法、音频播放装置和音频播放系统,涉及智能音箱技术领域。该方法的一具体实施方式包括:接收用户话术,将所述用户话术上传至服务端;接收所述服务端下发的音频资源和语音话术;播报所述语音话术,向所述服务端上传语音话术播报结束消息;接收所述服务端下发的影像资源,在处理所述音频数据的过程中将所述影像资源传输至影像投影设备,以使所述影像投影设备同步地投影所述影像资源。该实施方式能够解决无屏音箱不能进行视觉交互或带屏音箱屏幕大小限制的技术问题。(The invention discloses an audio playing method, an audio playing device and an audio playing system, and relates to the technical field of intelligent sound boxes. One embodiment of the method comprises: receiving a user speech operation, and uploading the user speech operation to a server; receiving audio resources and voice speech transmitted by the server; broadcasting the voice technology, and uploading a voice technology broadcasting end message to the server; and receiving the image resource issued by the server, and transmitting the image resource to image projection equipment in the process of processing the audio data so that the image projection equipment synchronously projects the image resource. The embodiment can solve the technical problem that the screen-free sound box cannot carry out visual interaction or the screen size of the sound box with the screen is limited.)

1. An audio playing method, comprising:

receiving a user speech operation, and uploading the user speech operation to a server;

receiving audio data and voice speech transmitted by the server;

broadcasting the voice technology, and uploading a voice technology broadcasting end message to the server;

and receiving the image resource issued by the server, and transmitting the image resource to image projection equipment in the process of processing the audio data so that the image projection equipment synchronously projects the image resource.

2. The method of claim 1, wherein transmitting the image resource to an image projection device comprises:

encoding the image resource by adopting a binary protocol;

and transmitting the encoded image resource to image projection equipment through a serial port.

3. The method of claim 1, wherein the audio data comprises an identification of respective audio assets;

transmitting the image assets to an image projection device during processing of the audio data, comprising:

acquiring a uniform resource locator of the audio resource according to the identifier of the audio resource, and uploading the uniform resource locator of the audio resource to the server;

and receiving an audio playing instruction issued by the server, acquiring and playing the audio resource according to the uniform resource locator of the audio resource, and transmitting the image resource to image projection equipment in the process of playing the audio resource.

4. The method of claim 1, wherein the audio data comprises an identification of each audio resource and its corresponding uniform resource locator;

transmitting the image assets to an image projection device during processing of the audio data, comprising:

and receiving an audio playing instruction issued by the server, acquiring and playing the audio resource according to the uniform resource locator of the audio resource, and transmitting the image resource to image projection equipment in the process of playing the audio resource.

5. The method of claim 1, wherein the image resources comprise at least one of text, pictures, and video; and/or the image projection equipment is a projection camera.

6. An audio playback apparatus, comprising:

the user module is used for receiving a user talk and uploading the user talk to the server;

the receiving module is used for receiving the audio data and the voice speech transmitted by the server;

the broadcasting module is used for broadcasting the phonetics and uploading a phonetics broadcasting end message to the server;

and the projection module is used for receiving the image resources issued by the server and transmitting the image resources to image projection equipment in the process of processing the audio data so that the image projection equipment synchronously projects the image resources.

7. An audio playback system, comprising: a sound box and an image projection device;

the sound box is used for receiving the user phonetics and uploading the user phonetics to the server; receiving audio data and voice speech transmitted by the server; broadcasting the voice technology, and uploading a voice technology broadcasting end message to the server; receiving an image resource issued by the server, and transmitting the image resource to an image projection device in the process of processing the audio data so that the image projection device synchronously projects the image resource;

the image projection equipment is used for receiving the image resources transmitted by the sound box and projecting the image resources.

8. The audio playback system of claim 1, wherein the image projection device is further configured to:

receiving the encoded image resource transmitted by the sound box through a serial port;

and decoding the encoded image resources by adopting a binary protocol, and projecting the decoded image resources.

9. An electronic device, comprising:

one or more processors;

a storage device for storing one or more programs,

the one or more programs, when executed by the one or more processors, implement the method of any of claims 1-5.

10. A computer-readable medium, on which a computer program is stored, which, when being executed by a processor, carries out the method according to any one of claims 1-5.

Technical Field

The invention relates to the technical field of intelligent sound boxes, in particular to an audio playing method, an audio playing device and an audio playing system.

Background

At present, intelligent sound boxes are more and more popular, convenience is brought to life of people, interaction between people and the intelligent sound boxes is not limited to voice, and people want to interact with intelligent hardware in the aspects of vision and touch.

In the process of implementing the invention, the inventor finds that at least the following problems exist in the prior art:

the audio amplifier on the existing market is divided into two kinds of area screen and no screen, and no screen audio amplifier can only be through pronunciation and people interact, can not be mutual with the user in the vision, and area screen audio amplifier can satisfy people and interact with intelligent hardware in vision and sense of touch, nevertheless will be expensive in the price many, but also can receive the restriction of screen size.

Disclosure of Invention

In view of this, embodiments of the present invention provide an audio playing method, an audio playing apparatus, and an audio playing system, so as to solve the technical problem that a sound box without a screen cannot perform visual interaction or the size of a screen of a sound box with a screen is limited.

To achieve the above object, according to an aspect of an embodiment of the present invention, there is provided an audio playing method including:

receiving a user speech operation, and uploading the user speech operation to a server;

receiving audio data and voice speech transmitted by the server;

broadcasting the voice technology, and uploading a voice technology broadcasting end message to the server;

and receiving the image resource issued by the server, and transmitting the image resource to image projection equipment in the process of processing the audio data so that the image projection equipment synchronously projects the image resource.

Optionally, transmitting the image resource to an image projection device includes:

encoding the image resource by adopting a binary protocol;

and transmitting the encoded image resource to image projection equipment through a serial port.

Optionally, the audio data comprises an identification of the respective audio resource;

transmitting the image assets to an image projection device during processing of the audio data, comprising:

acquiring a uniform resource locator of the audio resource according to the identifier of the audio resource, and uploading the uniform resource locator of the audio resource to the server;

and receiving an audio playing instruction issued by the server, acquiring and playing the audio resource according to the uniform resource locator of the audio resource, and transmitting the image resource to image projection equipment in the process of playing the audio resource.

Optionally, the audio data includes an identifier of each audio resource and a corresponding uniform resource locator thereof;

transmitting the image assets to an image projection device during processing of the audio data, comprising:

and receiving an audio playing instruction issued by the server, acquiring and playing the audio resource according to the uniform resource locator of the audio resource, and transmitting the image resource to image projection equipment in the process of playing the audio resource.

Optionally, the image resource includes at least one of a text, a picture and a video; and/or the image projection equipment is a projection camera.

In addition, according to another aspect of the embodiments of the present invention, there is provided an audio playback apparatus including:

the user module is used for receiving a user talk and uploading the user talk to the server;

the receiving module is used for receiving the audio data and the voice speech transmitted by the server;

the broadcasting module is used for broadcasting the phonetics and uploading a phonetics broadcasting end message to the server;

and the projection module is used for receiving the image resources issued by the server and transmitting the image resources to image projection equipment in the process of processing the audio data so that the image projection equipment synchronously projects the image resources.

Optionally, the projection module is further configured to:

encoding the image resource by adopting a binary protocol;

and transmitting the encoded image resource to image projection equipment through a serial port.

Optionally, the audio data comprises an identification of the respective audio resource;

the projection module is further configured to: acquiring a uniform resource locator of the audio resource according to the identifier of the audio resource, and uploading the uniform resource locator of the audio resource to the server;

and receiving an audio playing instruction issued by the server, acquiring and playing the audio resource according to the uniform resource locator of the audio resource, and transmitting the image resource to image projection equipment in the process of playing the audio resource.

Optionally, the audio data includes an identifier of each audio resource and a corresponding uniform resource locator thereof;

the projection module is further configured to: and receiving an audio playing instruction issued by the server, acquiring and playing the audio resource according to the uniform resource locator of the audio resource, and transmitting the image resource to image projection equipment in the process of playing the audio resource.

Optionally, the image resource includes at least one of a text, a picture and a video; and/or the image projection equipment is a projection camera.

In addition, according to another aspect of the embodiments of the present invention, there is provided an audio playing system including: a sound box and an image projection device;

the sound box is used for receiving the user phonetics and uploading the user phonetics to the server; receiving audio data and voice speech transmitted by the server; broadcasting the voice technology, and uploading a voice technology broadcasting end message to the server; receiving the image resource issued by the server, and transmitting the image resource to an image projection device in the process of processing and playing the audio resource data so that the image projection device synchronously projects the image resource;

the image projection equipment is used for receiving the image resources transmitted by the sound box and projecting the image resources.

Optionally, the image projection apparatus is further configured to:

receiving the encoded image resource transmitted by the sound box through a serial port;

and decoding the encoded image resources by adopting a binary protocol, and projecting the decoded image resources.

Optionally, the loudspeaker box is further configured to:

encoding the image resource by adopting a binary protocol;

and transmitting the encoded image resource to image projection equipment through a serial port.

Optionally, the audio data comprises an identification of the respective audio resource;

the loudspeaker box is also used for:

acquiring a uniform resource locator of the audio resource according to the identifier of the audio resource, and uploading the uniform resource locator of the audio resource to the server;

and receiving an audio playing instruction issued by the server, acquiring and playing the audio resource according to the uniform resource locator of the audio resource, and transmitting the image resource to image projection equipment in the process of playing the audio resource.

Optionally, the audio data includes an identifier of each audio resource and a corresponding uniform resource locator thereof;

the loudspeaker box is also used for:

and receiving an audio playing instruction issued by the server, acquiring and playing the audio resource according to the uniform resource locator of the audio resource, and transmitting the image resource to image projection equipment in the process of playing the audio resource.

Optionally, the image resource includes at least one of a text, a picture and a video; and/or the image projection equipment is a projection camera.

According to another aspect of the embodiments of the present invention, there is also provided an electronic device, including:

one or more processors;

a storage device for storing one or more programs,

when the one or more programs are executed by the one or more processors, the one or more processors implement the method of any of the embodiments described above.

According to another aspect of the embodiments of the present invention, there is also provided a computer readable medium, on which a computer program is stored, which when executed by a processor implements the method of any of the above embodiments.

One embodiment of the above invention has the following advantages or benefits: because the technical means of transmitting the image resources to the image projection equipment in the process of processing the audio data so as to enable the image projection equipment to synchronously project the image resources is adopted, the technical problem that the screen-free sound box cannot perform visual interaction or the screen size of the sound box with the screen is limited in the prior art is solved. According to the embodiment of the invention, the image resources are projected by the image projection equipment, so that the interaction between the screen-free sound box and the user is more vivid, the cost of intelligent hardware can be reduced, and the limitation of the size of a screen is not required to be considered. The embodiment of the invention enriches the scenes and contents of the intelligent sound box used by the user, so that the sound box without the screen can play audio resources and also can play image resources such as characters, pictures, videos and the like.

Further effects of the above-mentioned non-conventional alternatives will be described below in connection with the embodiments.

Drawings

The drawings are included to provide a better understanding of the invention and are not to be construed as unduly limiting the invention. Wherein:

fig. 1 is a schematic diagram of a main flow of an audio playing method according to an embodiment of the present invention;

fig. 2 is a schematic diagram of a main flow of an audio playing method according to a reference embodiment of the present invention;

fig. 3 is a schematic diagram of a main flow of an audio playing method according to another referential embodiment of the present invention;

fig. 4 is a schematic diagram of main modules of an audio playing apparatus according to an embodiment of the present invention;

FIG. 5 is an exemplary system architecture diagram in which embodiments of the present invention may be employed;

fig. 6 is a schematic block diagram of a computer system suitable for use in implementing a terminal device or server of an embodiment of the invention.

Detailed Description

Exemplary embodiments of the present invention are described below with reference to the accompanying drawings, in which various details of embodiments of the invention are included to assist understanding, and which are to be considered as merely exemplary. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the invention. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

Fig. 1 is a schematic diagram of a main flow of an audio playing method according to an embodiment of the present invention. As an embodiment of the present invention, as shown in fig. 1, the audio playing method may include:

step 101, receiving a user dialect, and uploading the user dialect to a server.

The sound box receives and identifies the user phonetics, is awakened and plays a friendly greeting phonetics; and then the sound box uploads the user telephone operation to the server. The server end receives the user dialect uploaded by the radio box, and then searches corresponding audio resources, voice dialect and image resources according to the user dialect. Optionally, the user utterance may be text or audio, which is not limited in this embodiment of the present invention.

Optionally, searching for corresponding audio resources, voice speech and image resources according to the user speech technology includes: parsing the user utterance to identify an intent and a slot value of the user; and matching corresponding audio resources, voice dialogues and image resources according to the intention and the slot value of the user. For example, if the user's speech is "i want to listen to a song of xxx", the identified intention is "listen to a song", and the slot value is "xxx", so that the corresponding audio resource, speech and image resource can be matched according to the identification result. It should be noted that, the user may configure "song of xxx" in advance, and may also configure the image resource corresponding to "song of xxx".

Optionally, the image resource includes at least one of a text, a picture and a video, the text may be a lyric, an advertisement word, a logo, a publicity word, a hint word, and the like, and accordingly, the picture may be a song cover or any picture configured by the user, and similarly, the video may also be configured by the user according to actual needs.

In the embodiment of the invention, the playing state of the sound box is synchronized at the server side, and the server side can issue different voice dialogues according to the current playing state of the sound box so as to prompt a user that the user stops playing the xxx songs, and the user starts playing the xxx songs, and the like, so that the user experience is improved.

And 102, receiving the audio data and the voice speech transmitted by the server.

After matching the audio resources, the voice dialogues and the image resources corresponding to the user dialogues, the server firstly issues the audio resources and the voice dialogues to the sound box, and the sound box receives the audio data and the voice dialogues issued by the server. The audio data includes an audio resource, an identifier of the audio resource, or an identifier of the audio resource and a corresponding uniform resource locator thereof.

It should be noted that, if the matching result is null, the server will also issue the corresponding voice call to the intelligent terminal to prompt the user that the search is failed.

And 103, broadcasting the phonetics, and uploading a phonetics broadcasting end message to the server.

And after the sound box receives the audio data and the voice operation sent by the server, starting to broadcast the voice operation. Because the playing state of the intelligent terminal is synchronized at the server, after the sound box finishes broadcasting the voice technology, the message of finishing the voice technology broadcasting is uploaded to the server, and the server synchronizes the state.

And 104, receiving the image resource issued by the server, and transmitting the image resource to an image projection device in the process of processing the audio data so that the image projection device synchronously projects the image resource.

Optionally, the audio data comprises an identification of the respective audio resource. Transmitting the image assets to an image projection device during processing of the audio data, comprising: acquiring a uniform resource locator of the audio resource according to the identifier of the audio resource, and uploading the uniform resource locator of the audio resource to the server; and receiving an audio playing instruction issued by the server, acquiring and playing the audio resource according to the uniform resource locator of the audio resource, and transmitting the image resource to image projection equipment in the process of playing the audio resource. In order to increase the issuing speed of the resource, in an embodiment of the present invention, after the server matches the resource, the server first issues the ID of the audio resource and the voice technology to the sound box, so that the sound box can start to load audio while broadcasting the voice technology. In the process of loading the resources, the sound box acquires the URL of the audio resource according to the ID of the audio resource (the URL of the audio resource needs to be acquired from another server), then synchronizes the URL to the server, and then the server issues an audio playing instruction, and the sound box acquires and plays the audio resource according to the URL of the audio resource.

The audio data includes an identification of each audio resource and its corresponding uniform resource locator. It should be noted that the identifiers of the audio resources correspond to the uniform resource locators of the audio resources one to one, and each audio resource has an identifier and a corresponding uniform resource locator. Transmitting the image assets to an image projection device during processing of the audio data, comprising: and receiving an audio playing instruction issued by the server, acquiring and playing the audio resource according to the uniform resource locator of the audio resource, and transmitting the image resource to image projection equipment in the process of playing the audio resource. In this embodiment, the server stores the URL address corresponding to the audio ID, so that the URL address can be issued to the sound box together, so that the sound box can obtain the audio resource more quickly.

Optionally, the audio data includes each audio resource, and in this embodiment, the server stores the audio resource, so that the audio resource can be directly issued to the intelligent terminal, and the intelligent terminal omits a step of loading the audio resource according to the URL address.

And after the server issues the audio playing instruction to the sound box, the server continues to issue the image resource to the sound box. After receiving the image resource issued by the server, the sound box transmits the image resource to an image projection device in the process of playing and processing the audio resource data, so that the image projection device synchronously projects the image resource. Optionally, the image projection device is a projection camera, and the image resource transmitted by the sound box can be projected to any place on a wall, the ground, a projection screen and the like by influencing the projection device, without being limited by the size of the screen.

The sound box provides local control of one local head, the next local head and the like, plays related audio resources, and synchronizes the playing state to the server side.

The sound box can transmit audio and video resources to the image projection equipment in modes of WIFI, Bluetooth, network cables, serial ports and the like. Optionally, transmitting the image resource to an image projection device includes: encoding the image resources by using a binary protocol (such as lua protocol); and transmitting the encoded image resource to image projection equipment through a serial port. Because the serial ports are all transmitted binary data, the embodiment of the invention converts the image resources into the binary data for serial port communication to realize plug and play.

According to the various embodiments described above, it can be seen that the technical problem that the screen-less sound box in the prior art cannot perform visual interaction or is limited by the size of the screen of the sound box with a screen is solved by the technical means of transmitting the image resources to the image projection device in the process of processing the audio data so that the image projection device synchronously projects the image resources. According to the embodiment of the invention, the interaction between the screen-free sound box and the user is more vivid through the image projection equipment, the cost of intelligent hardware can be reduced, and the limitation of the size of a screen is not required to be considered. The embodiment of the invention enriches the scenes and contents of the intelligent sound box used by the user, so that the sound box without the screen can play audio resources and also can play image resources such as characters, pictures, videos and the like.

Fig. 2 is a schematic diagram of a main flow of an audio playing method according to a reference embodiment of the present invention. As still another embodiment of the present invention, as shown in fig. 2, the audio playing method may include:

step 201, receiving a user dialect, and uploading the user dialect to a server.

The sound box receives and identifies the user phonetics, is awakened and plays a friendly greeting phonetics; and then the sound box uploads the user telephone operation to the server. The server end receives the user dialect uploaded by the radio box, and then searches corresponding audio resources, voice dialect and image resources according to the user dialect.

Step 202, receiving the identifier of the audio resource and the voice technology issued by the server.

After matching the audio resource, the voice technology and the image resource corresponding to the user technology, the server firstly issues the ID of the audio resource and the voice technology to the sound box, and the sound box receives the ID of the audio data and the voice technology issued by the server. And if the matching result is null, the server side issues the corresponding voice speech to the intelligent terminal so as to prompt the user that the search is failed.

And step 203, broadcasting the phonetics, and uploading a phonetics broadcasting end message to the server.

And after the sound box receives the audio data and the voice operation sent by the server, starting to broadcast the voice operation. Because the playing state of the intelligent terminal is synchronized at the server, after the sound box finishes broadcasting the voice technology, the message of finishing the voice technology broadcasting is uploaded to the server, and the server synchronizes the state.

And 204, acquiring the uniform resource locator of the audio according to the identifier of the audio resource, and uploading the uniform resource locator of the audio resource to the server.

Generally, the URL of the audio resource is stored in another server, so the speaker needs to obtain the URL of the audio resource from the server storing the URL according to the ID of the audio resource, and then synchronize the URL to the server.

Step 205, receiving an audio playing instruction issued by the server, and obtaining and playing the audio according to the uniform resource locator of the audio.

And after receiving the URL reported by the sound box, the server side issues an audio playing instruction, and the sound box acquires and plays the audio resource according to the URL of the audio resource.

Step 206, receiving the image resource issued by the server, and transmitting the image resource to an image projection device in the process of playing the audio resource, so that the image projection device synchronously projects the image resource.

And after the server issues the audio playing instruction to the sound box, the server continues to issue the image resource to the sound box. After receiving the image resource issued by the server, the sound box transmits the image resource to an image projection device in the process of playing the audio resource, so that the image projection device synchronously projects the image resource. Optionally, the image projection device is a projection camera, and the image resource transmitted by the sound box can be projected to any place on a wall, the ground, a projection screen and the like by influencing the projection device, without being limited by the size of the screen.

In addition, in one embodiment of the present invention, the detailed implementation of the audio playing method is described in detail above, so that the repeated description is not repeated here.

Fig. 3 is a schematic diagram of a main flow of an audio playing method according to another referential embodiment of the present invention. As another embodiment of the present invention, as shown in fig. 3, the audio playing method may include:

the user sends out a speaking command, the sound box receives the user, and then the user speaking is uploaded to the server side.

The server end receives the user dialect uploaded by the radio box, and then searches corresponding audio resources, voice dialect and image resources according to the user dialect.

And after matching the audio resource, the voice technology and the image resource corresponding to the user technology, the server side issues the ID of the audio resource and the voice technology to the sound box.

And after the sound box receives the ID of the audio data sent by the server and the voice operation, starting to broadcast the voice operation, and after the broadcast is finished, uploading the message of finishing the voice operation playing to the server so as to synchronize the playing state.

And the sound box acquires the URL according to the audio resource ID, and then reports the URL of the audio resource to the server side so as to synchronize the playing state.

And the server sends an audio playing instruction to the sound box, and the sound box acquires and plays the audio resource according to the URL of the audio resource. Meanwhile, the server sends the image resources to the sound box, and after the sound box receives the image resources, the image resources are coded by adopting a binary protocol (such as a lua protocol), and then the coded image resources are transmitted to the image projection equipment through a serial port.

The image projection equipment receives the encoded image resources transmitted by the sound box, then decodes the encoded image resources by adopting a binary protocol, and finally projects the decoded image resources.

In addition, in another embodiment of the present invention, the detailed implementation of the audio playing method is described in detail above, so that the repeated description is not repeated here.

Fig. 4 is a schematic diagram of main modules of an audio playing apparatus according to an embodiment of the present invention, and as shown in fig. 4, the audio playing apparatus 400 includes a user module 401, a receiving module 402, a broadcasting module 403, and a projecting module 404; the user module 401 is configured to receive a user dialect and upload the user dialect to a server; the receiving module 402 is configured to receive audio data and voice speech transmitted by the server; the broadcasting module 403 is configured to broadcast the voice technology and upload a voice technology broadcasting end message to the server; the projection module 404 is configured to receive an image resource issued by the server, and transmit the image resource to an image projection device in the process of playing and processing the audio resource data, so that the image projection device synchronously projects the image resource.

Optionally, the projection module 404 is further configured to:

encoding the image resource by adopting a binary protocol;

and transmitting the encoded image resource to image projection equipment through a serial port.

Optionally, the audio data comprises an identification of the respective audio resource;

the projection module 404 is further configured to:

acquiring a uniform resource locator of the audio resource according to the identifier of the audio resource, and uploading the uniform resource locator of the audio resource to the server;

and receiving an audio playing instruction issued by the server, acquiring and playing the audio resource according to the uniform resource locator of the audio resource, and transmitting the image resource to image projection equipment in the process of playing the audio resource.

Optionally, the audio data includes an identifier of each audio resource and a corresponding uniform resource locator thereof;

the projection module 404 is further configured to:

and receiving an audio playing instruction issued by the server, acquiring and playing the audio resource according to the uniform resource locator of the audio resource, and transmitting the image resource to image projection equipment in the process of playing the audio resource.

Optionally, the image resource includes at least one of a text, a picture and a video; and/or the image projection equipment is a projection camera.

According to the various embodiments described above, it can be seen that the technical problem that the screen-less sound box in the prior art cannot perform visual interaction or is limited by the size of the screen of the sound box with a screen is solved by the technical means of transmitting the image resources to the image projection device in the process of processing the audio data so that the image projection device synchronously projects the image resources. According to the embodiment of the invention, the interaction between the screen-free sound box and the user is more vivid through the image projection equipment, the cost of intelligent hardware can be reduced, and the limitation of the size of a screen is not required to be considered. The embodiment of the invention enriches the scenes and contents of the intelligent sound box used by the user, so that the sound box without the screen can play audio resources and also can play image resources such as characters, pictures, videos and the like.

It should be noted that, in the implementation of the audio playing apparatus of the present invention, the details of the audio playing method have been described in detail above, and therefore, the repeated descriptions herein will not be repeated.

The invention also provides an audio playing system, which comprises a sound box and an image projection device; the sound box is used for receiving the user phonetics and uploading the user phonetics to the server; receiving audio data and voice speech transmitted by the server; broadcasting the voice technology, and uploading a voice technology broadcasting end message to the server; receiving the image resource issued by the server, and transmitting the image resource to image projection equipment in the process of processing the audio resource data so that the image projection equipment synchronously projects the image resource; the image projection equipment is used for receiving the image resources transmitted by the sound box and projecting the image resources.

Optionally, the image projection apparatus is further configured to:

receiving the encoded image resource transmitted by the sound box through a serial port;

and decoding the encoded image resources by adopting a binary protocol, and projecting the decoded image resources.

Optionally, the loudspeaker box is further configured to:

encoding the image resource by adopting a binary protocol;

and transmitting the encoded image resource to image projection equipment through a serial port.

Optionally, the audio data comprises an identification of the respective audio resource;

the loudspeaker box is also used for:

acquiring a uniform resource locator of the audio resource according to the identifier of the audio resource, and uploading the uniform resource locator of the audio resource to the server;

and receiving an audio playing instruction issued by the server, acquiring and playing the audio resource according to the uniform resource locator of the audio resource, and transmitting the image resource to image projection equipment in the process of playing the audio resource.

Optionally, the audio data includes an identifier of each audio resource and a corresponding uniform resource locator thereof;

the loudspeaker box is also used for:

and receiving an audio playing instruction issued by the server, acquiring and playing the audio resource according to the uniform resource locator of the audio resource, and transmitting the image resource to image projection equipment in the process of playing the audio resource.

Optionally, the image resource includes at least one of a text, a picture and a video; and/or the image projection equipment is a projection camera.

Fig. 5 shows an exemplary system architecture 500 to which the audio playing method or audio playing apparatus of the embodiments of the present invention may be applied.

As shown in fig. 5, the system architecture 500 may include terminal devices 501, 502, 503, a network 504, and a server 505. The network 504 serves to provide a medium for communication links between the terminal devices 501, 502, 503 and the server 505. Network 504 may include various connection types, such as wired, wireless communication links, or fiber optic cables, to name a few.

The user may use the terminal devices 501, 502, 503 to interact with a server 505 over a network 504 to receive or send messages or the like. The terminal devices 501, 502, 503 may have installed thereon various communication client applications, such as shopping-like applications, web browser applications, search-like applications, instant messaging tools, mailbox clients, social platform software, etc. (by way of example only).

The terminal devices 501, 502, 503 may be various electronic devices having a display screen and supporting web browsing, including but not limited to smart phones, tablet computers, laptop portable computers, desktop computers, and the like.

The server 505 may be a server providing various services, such as a background management server (for example only) providing support for shopping websites browsed by users using the terminal devices 501, 502, 503. The background management server may analyze and otherwise process the received data such as the item information query request, and feed back a processing result (for example, target push information, item information — just an example) to the terminal device.

It should be noted that the audio playing method provided by the embodiment of the present invention is executed by the terminal devices 501, 502, and 503, and accordingly, the audio playing apparatus is disposed in the terminal devices 501, 502, and 503.

It should be understood that the number of terminal devices, networks, and servers in fig. 5 is merely illustrative. There may be any number of terminal devices, networks, and servers, as desired for implementation.

Referring now to FIG. 6, a block diagram of a computer system 600 suitable for use with a terminal device implementing an embodiment of the invention is shown. The terminal device shown in fig. 6 is only an example, and should not bring any limitation to the functions and the scope of use of the embodiments of the present invention.

As shown in fig. 6, the computer system 600 includes a Central Processing Unit (CPU)601 that can perform various appropriate actions and processes according to a program stored in a Read Only Memory (ROM)602 or a program loaded from a storage section 608 into a Random Access Memory (RAM) 603. In the RAM603, various programs and data necessary for the operation of the system 600 are also stored. The CPU 601, ROM 602, and RAM603 are connected to each other via a bus 604. An input/output (I/O) interface 605 is also connected to bus 604.

The following components are connected to the I/O interface 605: an input portion 606 including a keyboard, a mouse, and the like; an output portion 607 including a display such as a Cathode Ray Tube (CRT), a Liquid Crystal Display (LCD), and the like, and a speaker; a storage section 608 including a hard disk and the like; and a communication section 609 including a network interface card such as a LAN card, a modem, or the like. The communication section 609 performs communication processing via a network such as the internet. The driver 610 is also connected to the I/O interface 605 as needed. A removable medium 611 such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, or the like is mounted on the drive 610 as necessary, so that a computer program read out therefrom is mounted in the storage section 608 as necessary.

In particular, according to the embodiments of the present disclosure, the processes described above with reference to the flowcharts may be implemented as computer software programs. For example, embodiments of the present disclosure include a computer program comprising a computer program embodied on a computer readable medium, the computer program comprising program code for performing the method illustrated in the flow chart. In such an embodiment, the computer program may be downloaded and installed from a network through the communication section 609, and/or installed from the removable medium 611. The computer program performs the above-described functions defined in the system of the present invention when executed by the Central Processing Unit (CPU) 601.

It should be noted that the computer readable medium shown in the present invention can be a computer readable signal medium or a computer readable storage medium or any combination of the two. A computer readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination of the foregoing. More specific examples of the computer readable storage medium may include, but are not limited to: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the present invention, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. In the present invention, however, a computer readable signal medium may include a propagated data signal with computer readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated data signal may take many forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A computer readable signal medium may also be any computer readable medium that is not a computer readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device. Program code embodied on a computer readable medium may be transmitted using any appropriate medium, including but not limited to: wireless, wire, fiber optic cable, RF, etc., or any suitable combination of the foregoing.

The flowchart and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer programs according to various embodiments of the present invention. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams or flowchart illustration, and combinations of blocks in the block diagrams or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.

The modules described in the embodiments of the present invention may be implemented by software or hardware. The described modules may also be provided in a processor, which may be described as: a processor includes a user module, a receiving module, a broadcast module, and a projection module, where the names of these modules do not in some cases constitute a limitation on the module itself.

As another aspect, the present invention also provides a computer-readable medium that may be contained in the apparatus described in the above embodiments; or may be separate and not incorporated into the device. The computer readable medium carries one or more programs which, when executed by a device, implement the method of: receiving a user speech operation, and uploading the user speech operation to a server; receiving audio data and voice speech transmitted by the server; broadcasting the voice technology, and uploading a voice technology broadcasting end message to the server; and receiving the image resource issued by the server, and transmitting the image resource to image projection equipment in the process of processing the audio data so that the image projection equipment synchronously projects the image resource.

According to the technical scheme of the embodiment of the invention, because the technical means of transmitting the image resources to the image projection equipment in the process of processing the audio data so as to enable the image projection equipment to synchronously project the image resources is adopted, the technical problem that the screen-free sound box can not carry out visual interaction or the screen size of the sound box with the screen is limited in the prior art is solved. According to the embodiment of the invention, the interaction between the screen-free sound box and the user is more vivid through the image projection equipment, the cost of intelligent hardware can be reduced, and the limitation of the size of a screen is not required to be considered. The embodiment of the invention enriches the scenes and contents of the intelligent sound box used by the user, so that the sound box without the screen can play audio resources and also can play image resources such as characters, pictures, videos and the like.

The above-described embodiments should not be construed as limiting the scope of the invention. Those skilled in the art will appreciate that various modifications, combinations, sub-combinations, and substitutions can occur, depending on design requirements and other factors. Any modification, equivalent replacement, and improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

18页详细技术资料下载
上一篇:一种医用注射器针头装配设备
下一篇:信息处理方法和服务器、移动端

网友询问留言

已有0条留言

还没有人留言评论。精彩留言会获得点赞!

精彩留言,会给你点赞!

技术分类