This guide describes how to obtain and install binary packages for the Speechmatics Speech Recognition (SR) plugin to the UniMRCP server on Debian-based Linux distributions. The document is intended for system administrators and developers.
Instructions provided in this guide are applicable to the following versions.
UniMRCP 1.7.0 and above
UniMRCP Speechmatics SR Plugin 1.0.0 and above
UniMRCP deb packages are currently available for x86_64 (64-bit) architecture only.
Operating System | Released | End of Support |
---|---|---|
Ubuntu 20.04 LTS (focal) | June 2024 | TBA |
Ubuntu 22.04 LTS (jammy) | June 2024 | TBA |
Packages for other distributions can be developed and made available in the scope of an annual maintenance agreement.
UniMRCP binary packages are available to authenticated users only. In order to register a free account, please visit the following page.
A newly registered account needs to be verified by the user and then activated by the administrator prior further proceeding.
Account verification and activation.
The account credentials can be used to log in to the following websites.
Using the APT package handling utility (apt-get) is recommended for installation of UniMRCP binary packages.
Supply login information by creating a file /etc/apt/auth.conf.d/unimrcp.conf containing the following entry.
machine unimrcp.org
login username
password password
The username and password fields must be replaced with the corresponding account credentials.
Account credentials.
Configure a repository by creating a file /etc/apt/sources.list.d/unimrcp.list containing the following entry.
deb [arch=amd64] https://unimrcp.org/repo/apt/ distr main
The distr field must be replaced with the corresponding distribution code name such as xenial, bionic, focal, etc. To determine the distribution code, use lsb_release -cs
.
Distribution code name.
For verification of binary packages, UniMRCP provides a public GnuPG key, which can be retrieved and installed as follows.
wget -O - https://unimrcp.org/keys/unimrcp-gpg-key.public | sudo apt-key add -
In order to check for updates and apply the changes in the APT configuration, use the following command.
sudo apt-get update
In order to install the Speechmatics SR plugin, including all the dependencies, use the following command.
sudo apt-get install unimrcp-speechmatics-sr
As a result, apt-get will check and prompt to download all the required packages by installing them in the directory /opt/unimrcp.
In order to install the additional data files for the sample client application umc, the following command can be used.
sudo apt-get install umc-addons
This package is optional and provides additional data to be used for validation of basic setup.
UniMRCP deb packages can be installed manually using the dpkg utility. Note, however, that the system administrator should take care of package dependencies and install all the packages in appropriate order.
The deb packages have the following naming convention:
$packagename_$universion-$distr_$arch.deb
where
packagename is the name of a package
universion is the UniMRCP version
distr is the distribution code name (bionic, focal, …)
arch is the architecture (amd64, i386, all, …)
The following is a list of UniMRCP deb packages required for the installation of the Speechmatics SR plugin.
Package Name | Description |
---|---|
unimrcp-speechmatics-sr | Speechmatics SR plugin to the server. |
unilibevent | UniMRCP edition of the libevent library. |
umc-addons | Sample en-US data files used with umc. [Optional] |
unilicnodegen | Node information retrieval tool, required for license deployment. |
unimrcp-server | Shared library and application of the server. |
unimrcp-client | Shared libraries and sample applications of the client. [Optional] |
unimrcp-demo-plugins | Set of demo plugins to the server. [Optional] |
unimrcp-common | Data common for the client and the server. |
uniapr | UniMRCP edition of the Apache Portable Runtime (APR) library. |
uniapr-util | UniMRCP edition of the Apache Portable Runtime Utility (APR-Util) library. |
unisofia-sip | UniMRCP edition of the Sofia SIP library. |
Packages for APR, APR-Util and Sofia-SIP libraries must be installed first.
sudo dpkg --install uniapr_$aprversion-$distr_$arch.deb
sudo dpkg --install uniapr-util_$apuversion-$distr_$arch.deb
sudo dpkg --install unisofia-sip_$sofiaversion-$distr_$arch.deb
Then, a package containing common data for the client and the server, and a package for the server should follow.
sudo dpkg --install unimrcp-common_$universion-$distr_$arch.deb
sudo dpkg --install unimrcp-server_$universion-$distr_$arch.deb
Next, a package containing the utility tool unilicnodegen, required for license deployment.
sudo dpkg --install unilicnodegen_$toolversion-$distr_$arch.deb
Finally, a package containing the Speechmatics SR plugin should follow.
sudo dpkg --install unimrcp-speechmatics-sr_$universion-$distr_all.deb
The Speechmatics SR plugin to the UniMRCP server is a licensed software.
In order to obtain a trial license, node information must be retrieved and submitted for license generation.
Use the installed tool unilicnodegen to retrieve the node information.
sudo /opt/unimrcp/bin/unilicnodegen
As a result, a text file uninode.info will be saved in the current directory.
Consider the following procedure to place an order.
- Navigate to https://unispeech.io/smsr.
- Under the section Obtain License, select
- License Variation: Trial
- License Term: 30-day
- License Type: Node-bound
- License Quantity: 2
- Click Add to cart and then View cart.
- Click Proceed to checkout and then Place order.
- Attach the retrieved uninode.info file to the placed order.
Ther orders are normally processed within one business day. A license file will be provided as a result.
The license file needs to be placed into the directory /opt/unimrcp/data.
sudo cp umsspeechmatics_*.lic /opt/unimrcp/data
In order to utilize the Speechmatics Speech-to-Text API, corresponding credentials need to be obtained from Speechmatics and deployed to the UniMRCP server.
Obtain the following parameters associated to your Speechmatics account:
Create a text file speechmatics.subscription.key in the directory /opt/unimrcp/data.
sudo nano /opt/unimrcp/data/speechmatics.subscription.key
Place the obtained parameters in the following JSON format.
{
"auth-key": "*****,
"service-endpoint": "*****"
}
In order to load the Speechmatics SR plugin into the UniMRCP server, open the file unimrcpserver.xml, located in the directory /opt/unimrcp/conf, and add the following entry under the XML element <plugin-factory>. Disable other recognition plugins, if available. The demo plugins for other resources can also be disabled.
<!-- Factory of plugins (MRCP engines) -->
<plugin-factory>
<engine id="Demo-Recog-1" name="demorecog" enable="false"/>
<engine id="Speechmatics-SR-1" name="umsspeechmatics" enable="true"/>
</plugin-factory>
In order to enable log output from the plugin and set filtering rules, open the configuration file logger.xml, located in the directory /opt/unimrcp/conf, and add the following entry under the element <sources>.
<source name="SPEECHMATICS-PLUGIN" priority="INFO" masking="NONE"/>
The configuration file of the plugin is located in /opt/unimrcp/conf/umsspeechmatics.xml. Default settings should be sufficient for the general use.
Validate your setup by using the sample UniMRCP client and server applications on the same host. The default configuration and data files should be sufficient for a basic test.
Launch the UniMRCP server application.
cd /opt/unimrcp/bin
sudo ./unimrcpserver
In the server log output, check whether the plugin is normally loaded.
[INFO] Load Plugin [Speechmatics-SR-1] [/opt/unimrcp/plugin/umspeechmatics.so]
Next, check for the license information.
[NOTICE] UniMRCP Speechmatics License
-product name: umsspeechmatics
-product version: 1.0.0
-license owner: -
-license type: trial
-issue date: 2024-05-30
-exp date: 2024-06-29
-channel count: 2
-feature set: 0
The optional package umc-addons must be installed for this test to work.
Launch the sample UniMRCP client application umc.
cd /opt/unimrcp/bin
./umc
Run a typical speech recognition scenario by issuing the command run tsr1 from the console of the umc client application.
run tsr1
This command sends a RECOGNIZE request to the server and then starts streaming a sample audio input file bookroom.pcm to recognize.
Check for the NLSML results to be returned as expected.
<?xml version="1.0"?>
<result>
<interpretation grammar="builtin:speech/transcribe" confidence="1">
<instance>book a room</instance>
<input mode="speech">book a room</input>
</interpretation>
</result>
Visually inspect the log output for any possible warnings or errors.
Note that utterances are stored in the var directory, if the corresponding parameter is enabled in the configuration file umsspeechmatics.xml and/or requested by the client.