KSpeech Interface Reference

kspeech - the KDE Text-to-Speech API. More...

#include <kspeech.h>

Inheritance diagram for KSpeech:


Public Types
enum	kttsdJobState { jsQueued = 0, jsSpeakable = 1, jsSpeaking = 2, jsPaused = 3, jsFinished = 4 }
enum	kttsdMarkupType { mtPlain = 0, mtJsml = 1, mtSsml = 2, mtSable = 3, mtHtml = 4 }
Public Attributes
k_dcop_signals	__pad0__: void ignoreThis()

Detailed Description

kspeech - the KDE Text-to-Speech API.

Version:: 1.0 Draft 10

Since:: KDE 3.4

This class defines the DCOP interface for applications desiring to speak text. Applications may speak text by sending DCOP messages to application "kttsd" object "KSpeech".

KTTSD -- the KDE Text-to-Speech Deamon -- is the program that supplies the services in the KDE Text-to-Speech API.

Warning:: The KSpeech interface is still being developed and is likely to change in the future.

Features

Priority system for Screen Readers, warnings and messages, while still playing regular texts.
Long text is parsed into sentences. User may backup by sentence or part, replay, pause, and stop playing.
Handles multiple speaking applications. Text messages are treated like print jobs. Jobs may be created, started, stopped, paused, resumed, and deleted.
Speak contents of clipboard.
Speak KDE notifications.
Plugin-based text job filtering permits substitution for misspoken words, abbreviations, etc., transformation of XML or XHTML to SSML, and automatic choice of appropriate synthesis engine.

Requirements

You may build any KDE application to use KSpeech, since the interface is in kdelibs, but the kdeaccessibility package must be installed for KTTS to function.

You will need a speech synthesis engine, such as Festival. See the KTTS Handbook for the latest information on installing and configuring speech engines and voices with KTTS.

Design Goals

The KDE Text-to-Speech API is designed with the following goals:

Support the features enumerated above.
Plugin-based architecture for support of a wide variety of speech synthesis engines and drivers.
Permit generation of speech from the command line (or via shell scripts) using the KDE DCOP utilities.
Provide a lightweight and easily usable interface for applications to generate speech output.
Applications need not be concerned about contention over the speech device.
Provide limited support for speech markup languages, such as Sable, Java Speech Markup Language (JSML), and Speech Markup Meta-language (SMML).
Provide limited support for embedded speech markers.
Asynchronous to prevent system blocking.
Plugin-based audio architecture. Currently supports aRts but will support additional audio engines in the future, such as gstreamer.
Compatible with original KTTSD API as developed by José Pablo Ezequiel "Pupeno" Fernández (avoid breaking existing applications).

Architecturally, applications interface with KTTSD, which performs queueing, speech job managment, plugin management and sentence parsing. KTTSD interfaces with a KTTSD speech plugin(s), which then interfaces with the speech engine(s) or driver(s).

         application
              ^
              |  via DCOP (the KDE Text-to-Speech API)
              v
            kttsd
              ^
              |  KTTSD plugin API
              v
         kttsd plugin
              ^
              |
              v
        speech engine

The KTTSD Plugin API is documented in PluginConf in the kdeaccessibility module.

There is a separate GUI application, called kttsmgr, for providing KTTSD configuration and job management.

kttsd maintains 4 types of speech output:

Screen Reader Output
Warnings
Messages
Text Jobs

Method sayScreenReaderOutput speaks Screen Reader output. It pre-empts any other speech in progress, including other Screen Reader outputs, i.e., it is not a queue. This method is reserved for use by Screen Readers.

Methods sayWarning and sayMessage place messages into the Warnings and Messages queues respectively. Warnings take priority over messages, which take priority over text jobs. Warnings and messages are spoken when the currently-speaking sentence of a text job is finished.

setText places text into the text job queue. startText begins speaking jobs. When one job finishes, the next job begins. Method appendText adds additional parts to a text job. Within a text job, the application (and user via the kttsmgr GUI), may back up or advance by sentence or part, or rewind to the beginning. See jumpToTextPart and moveRelTextSentence. Text jobs may be paused, stopped, and resumed or deleted from the queue. See pauseText, stopText, resumeText, and removeText.

DCOP Command-line Interface

To create a text job to be spoken

     dcop kttsd KSpeech setText <text> <talker>

where <text> is the text to be spoken, and <talker> is usually a language code such as "en", "cy", etc.

Example.

     dcop kttsd KSpeech setText "This is a test." "en"

To start speaking the text.

     dcop kttsd KSpeech startText 0

You can combine the setText and startText commands into a single command.

     dcop kttsd KSpeech sayText <text> <talker>

Since:: KDE 3.5

To stop speaking and rewind to the beginning of the text.

     dcop kttsd KSpeech stopText 0

Depending upon the speech plugin used, speaking may not immediately stop.

To stop and remove a text job.

     dcop kttsd KSpeech removeText 0

Note: For more information about talker codes, see talkers below.

Calling KTTSD from a Program

There are two methods of making DCOP calls from your application to KTTSD.

Manually code them using dcopClient object. See kdebase/konqueror/kttsplugin/khtmlkttsd.cpp for an example. This method is recommended if you want to make a few simple calls to KTTSD.
Use kspeech_stub as described below. This method generates the marshalling code for you and is recommended for a more complex speech-enabled applications. kcmkttsmgr in the kdeaccessibility module is an example that uses this method.

To make DCOP calls from your program using kspeech_stub, follow these steps:

1. Include kspeech_stub.h in your code. Derive an object from the KSpeech_stub interface. For example, suppose you are developing a KPart and want to call KTTSD. Your class declaration might look like this:

     #include <kspeech_stub.h>
     class MyPart: public KParts::ReadOnlyPart, public KSpeech_stub {

2. In your class constructor, initialize DCOPStub, giving it the sender "kttsd", object "KSpeech".

     MyPart::MyPart(QWidget *parent, const char *name) :
        KParts::ReadOnlyPart(parent, name),
        DCOPStub("kttsd", "KSpeech") {

3. See if KTTSD is running, and if not, start it.

     DCOPClient *client = dcopClient();
     client->attach();
     if (!client->isApplicationRegistered("kttsd")) {
         QString error;
         if (KApplication::startServiceByDesktopName("kttsd", QStringList(), &error))
             cout << "Starting KTTSD failed with message " << error << endl;
     }

If you want to detect if KTTSD is installed without starting it, use this code.

     KTrader::OfferList offers = KTrader::self()->query("DCOP/Text-to-Speech", "Name == 'KTTSD'");
     if (offers.count() > 0)
     {
       // KTTSD is installed.
     }

Typically, you would do this to hide a menu item or button if KTTSD is not installed.

4. Make calls to KTTSD in your code.

     uint jobNum = setText("Hello World", "en");
     startText(jobNum);

4. Add kspeech_DIR and kspeech.stub to your Makefile.am. Example:

     kspeech_DIR = $(kde_includes)
     libmypart_la_SOURCES = kspeech.stub

Signals Emitted by KTTSD

KTTSD emits a number of DCOP signals, which provide information about sentences spoken, text jobs started, stopped, paused, resumed, finished, or deleted and markers seen. In general, these signals are broadcast to any application that connects to them. Applications should check the appId argument to determine whether the signal belongs to them or not.

To receive KTTSD DCOP signals, follow these steps:

1. Include kspeechsink.h in your code. Derive an object from the KSpeechSink interface and declare a method for each signal you'd like to receive. For example, if you were coding a KPart and wanted to receive the KTTSD signal sentenceStarted:

     #include <kspeechsink.h>
     class MyPart:
         public KParts::ReadOnlyPart,
         virtual public KSpeechSink
     {
         protected:
            ASYNC sentenceStarted(const QCString& appId, const uint jobNum, const uint seq);

You can combine sending and receiving in one object.

     #include <kspeechsink.h>
     class MyPart:
         public KParts::ReadOnlyPart,
         public KSpeech_stub,
         virtual public KSpeechSink
     {
         protected:
            ASYNC sentenceStarted(const QCString& appId, const uint jobNum, const uint seq);

See below for the signals you can declare.

2. In your class constructor, initialize DCOPObject with the name of your DCOP receiving object.

     MyPart::MyPart(QWidget *parent, const char *name) :
         KParts::ReadOnlyPart(parent, name),
         DCOPObject("mypart_kspeechsink") {

Use any name you like.

3. Where appropriate (usually in your constructor), make sure your DCOPClient is registered and connect the KTTSD DCOP signals to your declared receiving methods.

     // Register DCOP client.
     DCOPClient *client = kapp->dcopClient();
     if (!client->isRegistered())
     {
         client->attach();
         client->registerAs(kapp->name());
     }
     // Connect KTTSD DCOP signals to our slots.
     connectDCOPSignal("kttsd", "KSpeech",
         "sentenceStarted(QCString,uint,uint)",
         "sentenceStarted(QCString,uint,uint)",
         false);

Notice that the argument signatures differ slightly from the actual declarations. For example

     ASYNC sentenceStarted(const QCString& appId, const uint jobNum, const uint seq);

becomes

       "sentenceStarted(QCString,uint,uint)",

in the connectDCOPSignal call.

4. Write the definition for the received signal. Be sure to check whether the signal is intended for your application.

     ASYNC MyPart::sentenceStarted(const QCString& appId, const uint jobNum, const uint seq)
     {
         // Check appId to determine if this is our signal.
         if (appId != dcopClient()->appId()) return;
         // Do something here.
     }

5. Add kspeechsink_DIR and kspeechsink.skel to your Makefile.am. Example for an app both sending and receiving.

     kspeech_DIR = $(kde_includes)
     kspeechsink_DIR = $(kde_includes)
     libmypart_la_SOURCES = kspeech.stub kspeechsink.skel

Talkers, Talker Codes, and Plugins

Many of the methods permit you to specify a desired "talker". This may be a simple language code, such as "en" for English, "es" for Spanish, etc. Code as NULL to use the default configured talker.

Within KTTSMGR, the user has the ability to configure more than one talker for each language, with different voices, genders, volumes, and talking speeds.

Talker codes serve two functions:

They identify configured plugins, and
They provide a way for applications to specify the desired speaking attributes that influence the choice of plugin to speak text.

A Talker Code consists of a series of XML tags and attributes. An example of a full Talker Code with all attributes specified is

   <voice lang="en" name="kal" gender="male"/>
   <prosody volume="soft" rate="fast"/>
   <kttsd synthesizer="Festival" />

(The voice and prosody tags are adapted from the W3C Speech Synthesis Markup Language (SSML) and Java Speech Markup Language (JSML). The kttsd tag is an extension to the SMML and JSML languages to support named synthesizers and text encodings.) KTTS doesn't really care about the voice, prosody, and kttsd tags. In fact, they may be omitted and just the attributes specified. The example above then becomes

lang="en" name="kal" gender="male" volume="soft" rate="fast" synthesizer="Festival"

The attributes may be specified in any order.

For clarity, the rest of the discussion will omit the voice, prosody, and kttsd tags.

The attributes that make up a talker code are:

lang. Language code and optional country code. Examples: en, es, en_US, en_GB. Codes are case in-sensitive and hyphen (-) or underscore (_) may be used to separate the country code from the language code.
synthesizer. The name of the synthesizer (plugin) used to produce the speech.
gender. May be either "male", "female", or "neutral".
name. The name of the voice code. The choice of voice codes is synthesizer-specific.
volume. May be "loud", "medium", or "quiet". A synonym for "quiet" is "soft".
rate. May be "fast", "medium", or "slow".

Each plugin, once it has been configured by a user in kttsmgr, returns a fully-specified talker code to identify itself. If the plugin supports it, the user may configure another instance of the plugin with a different set of attributes. This is the difference between a "plugin" and a "talker". A talker is a configured instance of a plugin. Each plugin (if it supports it) may be configured as multiple talkers.

When the user configures KTTSD, she configures one or more talkers and then places them in preferred order, top to bottom in kttsmgr. In effect, she specifies her preferences for each of the talkers.

When applications specify a talker code, they need not (and typically do not) give a full specification. An example of a talker code with only some of the attributes specified might be

lang="en" gender="female"

If the talker code is not in XML attribute format, it assumed to be a lang attribute. So the talker code

en

is interpreted as

lang="en"

When a program requests a talker code in calls to setText, appendText, sayMessage, sayWarning, and sayScreenReaderOutput, KTTSD tries to match the requested talker code to the closest matching configured talker.

The lang attribute has highest priority (attempting to speak English with a Spanish synthesizer would likely be unintelligible). So the language attribute is said to have "priority". If an application does not specify a language attribute, a default one will be assumed. The rest of the attributes are said to be "preferred". If KTTSD cannot find a talker with the exact preferred attributes requested, the closest matching talker will likely still be understandable.

An application may specify that one or more of the attributes it gives in a talker code have priority by preceeding each priority attribute with an asterisk. For example, the following talker code

lang="en" gender="*female" volume="soft"

means that the application wants to use a talker that supports American English language and Female gender. If there is more than one such talker, one that supports Soft volume would be preferred. Notice that a talker configured as English, Male, and Soft volume would not be picked as long as an English Female talker is available.

The algorithm used by KTTSD to find a matching talker is as follows:

If language code is not specified by the application, assume default configured by user. The primary language code automatically has priority.
(Note: This is not yet implemented.) If there are no talkers configured in the language, KTTSD will attempt to automatically configure one (see automatic configuraton discussion below)
The talker that matches on the most priority attributes wins.
If a tie, the one that matches on the most preferred attributes wins.
If there is still a tie, the one nearest the top of the kttsmgr display (first configured) will be chosen.

Language codes actually consist of two parts, a language code and an optional country code. For example, en_GB is English (United Kingdom). The language code is treated as a priority attribute, but the country code (if specified) is treated as preferred. So for example, if an application requests the following talker code

lang="en_GB" gender="male" volume="medium"

then a talker configured as lang="en" gender="male" volume="medium" would be picked over one configured as lang="en_GB" gender="female" volume="soft", since the former matches on two preferred attributes and the latter only on the preferred attribute GB. An application can override this and make the country code priority with an asterisk. For example,

lang="*en_GB" gender="male" volume="medium"

To specify that American English is priority, put an asterisk in front of en_US, like this.

lang="*en_US" gender="male" volume="medium"

Here the application is indicating that a talker that speaks American English has priorty over one that speaks a different form of English.

(Note: Not yet implemented). If a language code is specified, and no plugin is currently configured with a matching language code, KTTSD will attempt to automatically load and configure a plugin to support the requested language. If there is no such plugin, or there is a plugin but it cannot automatically configure itself, KTTSD will pick one of the configured plugins using the algorithm given above.

Notice that KTTSD will always pick a talker, even if it is a terrible match. (The principle is that something heard is better than nothing at all. If it sounds terrible, user will change his configuration.) If an attribute is absolutely mandatory -- in other words the application must speak with the attribute or not at all -- the application can determine if there are any talkers configured with the attribute by calling getTalkers, and if there are none, display an error message to the user.

Applications can implement their own talker-matching algorithm by calling getTalkers, then finding the desired talker from the returned list. When the full talker code is passed in, KKTSD will find an exact match and use the specified talker.

If an application requires a configuration that user has not created, it should display a message to user instructing them to run kttsmgr and configure the desired talker. (This must be done interactively because plugins often need user assistance locating voice files, etc.)

The above scheme is designed to balance the needs of applications against user preferences. Applications are given the control they might need, without unnecessarily burdening the application author. If you are an application author, the above discussion might seem overly complicated. It isn't really all that complicated. Here are rules of thumb:

It is legitimate to give a NULL (0) talker code, in which case, the user's default talker will be used.
If you know the language code, give that in the talker code, otherwise leave it out.
If there is an attribute your application requires for proper functioning, specify that with an asterisk in front of it. For example, your app might speak in two different voices, Male and Female. (Since your app requires both genders, call getTalkers to determine if both genders are available, and if not, advise user to configure them. Better yet, give the user a choice of available distinquishing attributes (loud/soft, fast/slow, etc.)
If there are other attributes you would prefer, specify those without an asterisk, but leave them out if it doesn't really make any difference to proper functioning of your application. Let the user decide them when they configure KTTS.

One final note about talkers. KTTSD does talker matching for each sentence spoken, just before the sentence is sent to a plugin for synthesis. Therefore, the user can change the effective talker in mid processing of a text job by changing his preferences, or even deleting or adding new talkers to the configuration.

Speech Markup

Note: Speech Markup is not yet fully implemented in KTTSD.

Each of the five methods for queueing text to be spoken -- sayScreenReaderOutput, setText, appendText, sayMessage, and sayWarning -- may contain speech markup, provided that the plugin the user has configured supports that markup. The markup languages and plugins currently supported are:

Speech Synthesis Markup language (SSML): Festival and Hadifix.

This may change in the future as synthesizers improve.

Before including markup in the text sent to kttsd, the application should query whether the currently-configured plugin supports the markup language by calling supportsMarkup.

It it does not support the markup, it will be stripped out of the text.

Support for Markers

Note: Markers are not yet implemented in KTTSD.

When using a speech markup language, such as Sable, JSML, or SSML, the application may embed named markers into the text. If the user's chosen speech plugin supports markers, KTTSD will emit DCOP signal markerSeen when the speech engine encounters the marker. Depending upon the speech engine and plugin, this may occur either when the speech engine encounters the marker during synthesis from text to speech, or when the speech is actually spoken on the audio device. The calling application can call the supportsMarkers method to determine if the currently configured plugin supports markers or not.

Sentence Parsing

Not all speech engines provide robust capabilities for stopping synthesis that is in progress. To compensate for this, KTTSD parses text jobs given to it by the setText and appendText methods into sentences and sends the sentences to the speech plugin one at a time. In this way, should the user wish to stop the speech output, they can do so, and the worst that will happen is that the last sentence will be completed. This is called Sentence Boundary Detection (SBD).

Sentence Boundary Detection also permits the user to rewind by sentences.

The default sentence delimiter used for plain text is as follows:

A period (.), question mark (?), exclamation mark (!), colon (:), or semi-colon (;) followed by whitespace (including newline), or
Two newlines in a row separated by optional whitespace, or
The end of the text.

When given text containing speech markup, KTTSD automatically determines the markup type and parses based on the sentence semantics of the markup language.

An application may change the sentence delimiter by calling setSentenceDelimiter prior to calling setText. Changing the delimiter does not affect other applications.

Text given to KTTSD via the sayWarning, sayMessage, and sayScreenReaderOutput methods is not parsed into sentences. For this reason, applications should not send long messages with these methods.

Sentence Boundary Detection is implemented as a plugin SBD filter. See filters for more information.

Filters

Users may specify filters in the kttsmgr GUI. Filters are plugins that modify the text to be spoken or change other characteristics of jobs. Currently, the following filter plugins are available:

String Replacer. Permits users to substitute for mispoken words, or vocalize chat emoticons.
XML Transformer. Given a particular XML or XHTML format, permits conversion of the XML to SSML (Speech Synthesis Markup Language) using XSLT (XML Style Language - Transforms) stylesheets.
Talker Chooser. Permits users to redirect jobs from one configured Talker to another based on the contents of the job or application that sent it.

Additional plugins may be available in the future.

In additional to these regular filters, KTTS also implements Sentence Boundary Detection (SBD) as a plugin filter. See sentenceparsing for more information.

Regular filters are applied to Warnings, Messages, and Text jobs. SBD filters are only applied to regular Text jobs; they are not applied to Warnings and Messages. Screen Reader Outputs are never filtered.

Authors

Author:

José Pablo Ezequiel "Pupeno" Fernández <pupeno@kde.org>

Gary Cramblitt <garycramblitt@comcast.net>

Olaf Schmidt <ojschmidt@kde.org>

Gunnar Schmi Dt <gunnar@schmi-dt.de>

Definition at line 646 of file kspeech.h.

Member Enumeration Documentation

enum KSpeech::kttsdJobState

Job states returned by method getTextJobState.

Enumerator:

jsQueued	Job has been queued but is not yet speakable.
jsSpeakable	Job is speakable, but is not speaking.
jsSpeaking	Job is currently speaking.
jsPaused	Job has been paused.
jsFinished	Job is finished and is deleteable.

Definition at line 654 of file kspeech.h.

enum KSpeech::kttsdMarkupType

Speech markup language types.

Enumerator:

mtPlain	Plain text.
mtJsml	Java Speech Markup Language.
mtSsml	Speech Synthesis Markup Language
mtSable	Sable 2.0.
mtHtml	HTML. Since: 3.5

Definition at line 667 of file kspeech.h.

Member Function Documentation

virtual int KSpeech::appendText	(	const QString &	text,
		uint	jobNum = `0`
	)			`[pure virtual]`

Adds another part to a text job.

Does not start speaking the text.

Parameters:

	text	The message to be spoken.
	jobNum	Job number of the text job. If zero, applies to the last job queued by the application, but if no such job, applies to the current job (if any).

Returns:: Part number for the added part. Parts are numbered starting at 1.

The text is parsed into individual sentences. Call getTextCount to retrieve the sentence count. Call startText to mark the job as speakable and if the job is the first speakable job in the queue, speaking will begin.

See also:: setText.
startText.

virtual ASYNC KSpeech::changeTextTalker	(	const QString &	talker,
		uint	jobNum = `0`
	)			`[pure virtual]`

Change the talker for a text job.

Parameters:

	jobNum	Job number of the text job. If zero, applies to the last job queued by the application, but if no such job, applies to the current job (if any).
	talker	New code for the talker to do the speaking. Example "en". If NULL, defaults to the user's default talker. If no plugin has been configured for the specified Talker code, defaults to the closest matching talker.

virtual uint KSpeech::getCurrentTextJob ( ) [pure virtual]

Get the job number of the current text job.

Returns:: Job number of the current text job. 0 if no jobs.

Note that the current job may not be speaking. See isSpeakingText.

See also:: getTextJobState.
isSpeakingText

virtual QStringList KSpeech::getTalkers ( ) [pure virtual]

Get a list of the talkers configured in KTTS.

Returns:: A QStringList of fully-specified talker codes, one for each talker user has configured.

See also:: talkers

virtual int KSpeech::getTextCount ( uint jobNum = 0 ) [pure virtual]

Get the number of sentences in a text job.

Parameters:

jobNum

Job number of the text job. If zero, applies to the last job queued by the application, but if no such job, applies to the current job (if any).

Returns:: The number of sentences in the job. -1 if no such job.

The sentences of a job are given sequence numbers from 1 to the number returned by this method. The sequence numbers are emitted in the sentenceStarted and sentenceFinished signals.

virtual uint KSpeech::getTextJobCount ( ) [pure virtual]

Get the number of jobs in the text job queue.

Returns:: Number of text jobs in the queue. 0 if none.

virtual QByteArray KSpeech::getTextJobInfo ( uint jobNum = 0 ) [pure virtual]

Get information about a text job.

Parameters:

jobNum

Job number of the text job. If zero, applies to the last job queued by the application, but if no such job, applies to the current job (if any).

Returns:: A QDataStream containing information about the job. Blank if no such job.

The stream contains the following elements:

int state - Job state.
QCString appId - DCOP senderId of the application that requested the speech job.
QString talker - Talker Code requested by application.
int seq - Current sentence being spoken. Sentences are numbered starting at 1.
int sentenceCount - Total number of sentences in the job.
int partNum - Current part of the job begin spoken. Parts are numbered starting at 1.
int partCount - Total number of parts in the job.

Note that sequence numbers apply to the entire job. They do not start from 1 at the beginning of each part.

The following sample code will decode the stream:

                    QByteArray jobInfo = getTextJobInfo(jobNum);
                    QDataStream stream(jobInfo, IO_ReadOnly);
                    int state;
                    QCString appId;
                    QString talker;
                    int seq;
                    int sentenceCount;
                    int partNum;
                    int partCount;
                    stream >> state;
                    stream >> appId;
                    stream >> talker;
                    stream >> seq;
                    stream >> sentenceCount;
                    stream >> partNum;
                    stream >> partCount;

virtual QString KSpeech::getTextJobNumbers ( ) [pure virtual]

Get a comma-separated list of text job numbers in the queue.

Returns:: Comma-separated list of text job numbers in the queue.

virtual QString KSpeech::getTextJobSentence	(	uint	jobNum = `0`,
		uint	seq = `0`
	)			`[pure virtual]`

Return a sentence of a job.

Parameters:

	jobNum	Job number of the text job. If zero, applies to the last job queued by the application, but if no such job, applies to the current job (if any).
	seq	Sequence number of the sentence.

Returns:: The specified sentence in the specified job. If no such job or sentence, returns "".

virtual int KSpeech::getTextJobState ( uint jobNum = 0 ) [pure virtual]

Get the state of a text job.

Parameters:

jobNum

Job number of the text job. If zero, applies to the last job queued by the application, but if no such job, applies to the current job (if any).

Returns:: State of the job. -1 if invalid job number.

See also:: kttsdJobState

virtual bool KSpeech::isSpeakingText ( ) const [pure virtual]

Determine if kttsd is currently speaking any text jobs.

Returns:: True if currently speaking any text jobs.

virtual int KSpeech::jumpToTextPart	(	int	partNum,
		uint	jobNum = `0`
	)			`[pure virtual]`

Jump to the first sentence of a specified part of a text job.

Parameters:

	partNum	Part number of the part to jump to. Parts are numbered starting at 1.
	jobNum	Job number of the text job. If zero, applies to the last job queued by the application, but if no such job, applies to the current job (if any).

Returns:: Part number of the part actually jumped to.

If partNum is greater than the number of parts in the job, jumps to last part. If partNum is 0, does nothing and returns the current part number. If no such job, does nothing and returns 0. Does not affect the current speaking/not-speaking state of the job.

virtual void KSpeech::kttsdExit ( ) [pure virtual]

Stop the service.

void KSpeech::kttsdExiting ( )

This signal is emitted just before KTTSD exits.

void KSpeech::kttsdStarted ( )

This signal is emitted when KTTSD starts or restarts after a call to reinit.

void KSpeech::markerSeen	(	const QCString &	appId,
		const QString &	markerName
	)

This signal is emitted when the speech engine/plugin encounters a marker in the text.

Parameters:

	appId	DCOP application ID of the application that queued the text.
	markerName	The name of the marker seen.

See also:: markers

virtual uint KSpeech::moveRelTextSentence	(	int	n,
		uint	jobNum = `0`
	)			`[pure virtual]`

Advance or rewind N sentences in a text job.

Parameters:

	n	Number of sentences to advance (positive) or rewind (negative) in the job.
	jobNum	Job number of the text job. If zero, applies to the last job queued by the application, but if no such job, applies to the current job (if any).

Returns:: Sequence number of the sentence actually moved to. Sequence numbers are numbered starting at 1.

If no such job, does nothing and returns 0. If n is zero, returns the current sequence number of the job. Does not affect the current speaking/not-speaking state of the job.

virtual ASYNC KSpeech::moveTextLater ( uint jobNum = 0 ) [pure virtual]

Move a text job down in the queue so that it is spoken later.

Parameters:

jobNum

Job number of the text job. If zero, applies to the last job queued by the application, but if no such job, applies to the current job (if any).

If the job is currently speaking, it is paused. If the next job in the queue is speakable, it begins speaking.

virtual ASYNC KSpeech::pauseText ( uint jobNum = 0 ) [pure virtual]

Pause a text job.

Parameters:

jobNum

Job number of the text job. If zero, applies to the last job queued by the application, but if no such job, applies to the current job (if any).

The job is marked as paused and will not be speakable until resumeText or startText is called.

If there are speaking jobs preceeding this one in the queue, they continue speaking.

If the job is currently speaking, the textPaused signal is emitted and the job stops speaking. Note that if the next job in the queue is speakable, it does not start speaking as long as this job is paused.

Depending upon the speech engine and plugin used, speech may not stop immediately (it might finish the current sentence).

See also:: resumeText

virtual void KSpeech::reinit ( ) [pure virtual]

Re-start KTTSD.

virtual ASYNC KSpeech::removeText ( uint jobNum = 0 ) [pure virtual]

Remove a text job from the queue.

Parameters:

jobNum

Job number of the text job. If zero, applies to the last job queued by the application, but if no such job, applies to the current job (if any).

The job is deleted from the queue and the textRemoved signal is emitted.

If there is another job in the text queue, and it is marked speakable, that job begins speaking.

virtual ASYNC KSpeech::resumeText ( uint jobNum = 0 ) [pure virtual]

Start or resume a text job where it was paused.

Parameters:

jobNum

Job number of the text job. If zero, applies to the last job queued by the application, but if no such job, applies to the current job (if any).

The job is marked speakable.

If the job is currently speaking, or is waiting to be spoken (speakable state), the resumeText() call is ignored.

If the job is currently queued, or is finished, it is the same as calling

See also:: startText .

If there are speaking jobs preceeding this one in the queue, those jobs continue speaking and when finished this job will begin speaking where it left off.

The textResumed signal is emitted when the job resumes.

See also:: pauseText

virtual ASYNC KSpeech::sayMessage	(	const QString &	message,
		const QString &	talker
	)			`[pure virtual]`

Say a message.

The message will be spoken when the current sentence stops speaking but after any warnings have been spoken. Messages should be used for one-shot messages that can't wait for normal text messages to stop speaking, such as "You have mail.".

Parameters:

	message	The message to be spoken.
	talker	Code for the talker to do the speaking. Example "en". If NULL, defaults to the user's default talker. If no talker has been configured for the specified talker code, defaults to the closest matching talker.

virtual ASYNC KSpeech::sayScreenReaderOutput	(	const QString &	msg,
		const QString &	talker
	)			`[pure virtual]`

Say a message as soon as possible, interrupting any other speech in progress.

IMPORTANT: This method is reserved for use by Screen Readers and should not be used by any other applications.

Parameters:

	msg	The message to be spoken.
	talker	Code for the talker to do the speaking. Example "en". If NULL, defaults to the user's default talker. If no plugin has been configured for the specified Talker code, defaults to the closest matching talker.

If an existing Screen Reader output is in progress, it is stopped and discarded and replaced with this new message.

virtual uint KSpeech::sayText	(	const QString &	text,
		const QString &	talker
	)			`[pure virtual]`

Say a plain text job.

This is a convenience method that combines setText and startText into a single call.

Parameters:

	text	The message to be spoken.
	talker	Code for the talker to do the speaking. Example "en". If NULL, defaults to the user's default plugin. If no plugin has been configured for the specified Talker code, defaults to the closest matching talker.

Returns:: Job number.

Plain text is parsed into individual sentences using the current sentence delimiter. Call setSentenceDelimiter to change the sentence delimiter prior to calling setText. Call getTextCount to retrieve the sentence count after calling setText.

The text may contain speech mark language, such as Sable, JSML, or SSML, provided that the speech plugin/engine support it. In this case, sentence parsing follows the semantics of the markup language.

The job is marked speakable. If there are other speakable jobs preceeding this one in the queue, those jobs continue speaking and when finished, this job will begin speaking. If there are no other speakable jobs preceeding this one, it begins speaking.

See also:: getTextCount

Since:: KDE 3.5

virtual ASYNC KSpeech::sayWarning	(	const QString &	warning,
		const QString &	talker
	)			`[pure virtual]`

Say a warning.

The warning will be spoken when the current sentence stops speaking and takes precedence over Messages and regular text. Warnings should only be used for high-priority messages requiring immediate user attention, such as "WARNING. CPU is overheating."

Parameters:

	warning	The warning to be spoken.
	talker	Code for the talker to do the speaking. Example "en". If NULL, defaults to the user's default talker. If no plugin has been configured for the specified Talker code, defaults to the closest matching talker.

void KSpeech::sentenceFinished	(	const QCString &	appId,
		uint	jobNum,
		uint	seq
	)

This signal is emitted when a sentence has finished speaking.

Parameters:

	appId	DCOP application ID of the application that queued the text.
	jobNum	Job number of the text job.
	seq	Sequence number of the text.

See also:: getTextCount

void KSpeech::sentenceStarted	(	const QCString &	appId,
		uint	jobNum,
		uint	seq
	)

This signal is emitted whenever a sentence begins speaking.

Parameters:

	appId	DCOP application ID of the application that queued the text.
	jobNum	Job number of the text job.
	seq	Sequence number of the text.

See also:: getTextCount

virtual uint KSpeech::setFile	(	const QString &	filename,
		const QString &	talker,
		const QString &	encoding
	)			`[pure virtual]`

Queue a text job from the contents of a file.

Does not start speaking the text.

Parameters:

	filename	Full path to the file to be spoken. May be a URL.
	talker	Code for the talker to do the speaking. Example "en". If NULL, defaults to the user's default talker. If no plugin has been configured for the specified Talker code, defaults to the closest matching talker.
	encoding	Name of the encoding to use when reading the file. If NULL or Empty, uses default stream encoding.

Returns:: Job number. 0 if an error occurs.

Plain text is parsed into individual sentences using the current sentence delimiter. Call setSentenceDelimiter to change the sentence delimiter prior to calling setText. Call getTextCount to retrieve the sentence count after calling setText.

The text may contain speech mark language, such as Sable, JSML, or SSML, provided that the speech plugin/engine support it. In this case, sentence parsing follows the semantics of the markup language.

Call startText to mark the job as speakable and if the job is the first speakable job in the queue, speaking will begin.

See also:: getTextCount
startText

virtual ASYNC KSpeech::setSentenceDelimiter ( const QString & delimiter ) [pure virtual]

Sets the GREP pattern that will be used as the sentence delimiter.

Parameters:

delimiter

A valid GREP pattern.

The default sentence delimiter is

              ([\\.\\?\\!\\:\\;])(\\s|$|(\\n *\\n))

Note that backward slashes must be escaped. When KTTSD parses the text, it replaces all tabs, spaces, and formfeeds with a single space, and then replaces the sentence delimiters using the following statement:

              QString::replace(sentenceDelimiter, "\\1\t");

which replaces all sentence delimiters with a tab, but preserving the first capture text (first parenthesis). In other words, the sentence punctuation is preserved. The tab is later used to separate the text into sentences.

Changing the sentence delimiter does not affect other applications.

See also:: sentenceparsing

virtual uint KSpeech::setText	(	const QString &	text,
		const QString &	talker
	)			`[pure virtual]`

Queue a text job.

Does not start speaking the text.

Parameters:

	text	The message to be spoken.
	talker	Code for the talker to do the speaking. Example "en". If NULL, defaults to the user's default plugin. If no plugin has been configured for the specified Talker code, defaults to the closest matching talker.

Returns:: Job number.

Plain text is parsed into individual sentences using the current sentence delimiter. Call setSentenceDelimiter to change the sentence delimiter prior to calling setText. Call getTextCount to retrieve the sentence count after calling setText.

The text may contain speech mark language, such as Sable, JSML, or SSML, provided that the speech plugin/engine support it. In this case, sentence parsing follows the semantics of the markup language.

Call startText to mark the job as speakable and if the job is the first speakable job in the queue, speaking will begin.

See also:: getTextCount
startText

virtual void KSpeech::showDialog ( ) [pure virtual]

Displays the KTTS Manager dialog.

In this dialog, the user may backup or skip forward in any text job by sentence or part, rewind jobs, pause or resume jobs, or delete jobs.

virtual ASYNC KSpeech::speakClipboard ( ) [pure virtual]

Add the clipboard contents to the text queue and begin speaking it.

virtual ASYNC KSpeech::startText ( uint jobNum = 0 ) [pure virtual]

Start a text job at the beginning.

Parameters:

jobNum

Job number of the text job. If zero, applies to the last job queued by the application, but if no such job, applies to the current job (if any).

Rewinds the job to the beginning.

The job is marked speakable. If there are other speakable jobs preceeding this one in the queue, those jobs continue speaking and when finished, this job will begin speaking. If there are no other speakable jobs preceeding this one, it begins speaking.

The textStarted signal is emitted when the text job begins speaking. When all the sentences of the job have been spoken, the job is marked for deletion from the text queue and the textFinished signal is emitted.

virtual ASYNC KSpeech::stopText ( uint jobNum = 0 ) [pure virtual]

Stop a text job and rewind to the beginning.

Parameters:

jobNum

Job number of the text job. If zero, applies to the last job queued by the application, but if no such job, applies to the current job (if any).

The job is marked not speakable and will not be speakable until startText or resumeText is called.

If there are speaking jobs preceeding this one in the queue, they continue speaking.

If the job is currently speaking, the textStopped signal is emitted, the job stops speaking, and if the next job in the queue is speakable, it begins speaking.

Depending upon the speech engine and plugin used, speech may not stop immediately (it might finish the current sentence).

virtual bool KSpeech::supportsMarkers ( const QString & talker ) const [pure virtual]

Determine whether the currently-configured speech plugin supports markers in speech markup.

Parameters:

talker

Code for the talker to do the speaking. Example "en". If NULL, defaults to the user's default talker.

Returns:: True if the plugin currently configured for the indicated talker supports markers.

virtual bool KSpeech::supportsMarkup	(	const QString &	talker,
		uint	markupType = `0`
	)			const `[pure virtual]`

Determine whether the currently-configured speech plugin supports a speech markup language.

Parameters:

	talker	Code for the talker to do the speaking. Example "en". If NULL, defaults to the user's default talker.
	markupType	The kttsd code for the desired speech markup language.

Returns:: True if the plugin currently configured for the indicated talker supports the indicated speech markup language.

See also:: kttsdMarkupType

virtual QString KSpeech::talkerCodeToTalkerId ( const QString & talkerCode ) [pure virtual]

Given a Talker Code, returns the Talker ID of the talker that would speak a text job with that Talker Code.

Parameters:

talkerCode

Talker Code.

Returns:: Talker ID of the talker that would speak the text job.

void KSpeech::textAppended	(	const QCString &	appId,
		uint	jobNum,
		int	partNum
	)

This signal is emitted whenever a new part is appended to a text job.

Parameters:

	appId	The DCOP senderId of the application that created the job.
	jobNum	Job number of the text job.
	partNum	Part number of the new part. Parts are numbered starting at 1.

void KSpeech::textFinished	(	const QCString &	appId,
		uint	jobNum
	)

This signal is emitted whenever a text job is finished.

The job has been marked for deletion from the queue and will be deleted when another job reaches the Finished state. (Only one job in the text queue may be in state Finished at one time.) If startText or resumeText is called before the job is deleted, it will remain in the queue for speaking.

Parameters:

	appId	The DCOP senderId of the application that created the job.
	jobNum	Job number of the text job.

void KSpeech::textPaused	(	const QCString &	appId,
		uint	jobNum
	)

This signal is emitted whenever a speaking text job is paused.

Parameters:

	appId	The DCOP senderId of the application that created the job.
	jobNum	Job number of the text job.

void KSpeech::textRemoved	(	const QCString &	appId,
		uint	jobNum
	)

This signal is emitted whenever a text job is deleted from the queue.

The job is no longer in the queue when this signal is emitted.

Parameters:

	appId	The DCOP senderId of the application that created the job.
	jobNum	Job number of the text job.

void KSpeech::textResumed	(	const QCString &	appId,
		uint	jobNum
	)

This signal is emitted when a text job, that was previously paused, resumes speaking.

Parameters:

	appId	The DCOP senderId of the application that created the job.
	jobNum	Job number of the text job.

void KSpeech::textSet	(	const QCString &	appId,
		uint	jobNum
	)

This signal is emitted whenever a new text job is added to the queue.

Parameters:

	appId	The DCOP senderId of the application that created the job.
	jobNum	Job number of the text job.

void KSpeech::textStarted	(	const QCString &	appId,
		uint	jobNum
	)

This signal is emitted whenever speaking of a text job begins.

Parameters:

	appId	The DCOP senderId of the application that created the job.
	jobNum	Job number of the text job.

void KSpeech::textStopped	(	const QCString &	appId,
		uint	jobNum
	)

This signal is emitted whenever a speaking text job stops speaking.

Parameters:

	appId	The DCOP senderId of the application that created the job.
	jobNum	Job number of the text job.

The signal is only emitted if stopText() is called and the job is currently speaking.

virtual QString KSpeech::userDefaultTalker ( ) [pure virtual]

Get the user's default talker.

Returns:: A fully-specified talker code.

See also:: talkers
getTalkers

virtual QString KSpeech::version ( ) [pure virtual]

Return the KTTSD deamon version number.

Since:: KDE 3.5

Member Data Documentation

k_dcop_signals KSpeech::__pad0__

Definition at line 1183 of file kspeech.h.

The documentation for this interface was generated from the following file:

kspeech.h

interfaces

KSpeech Interface Reference

Public Types

Public Attributes

Detailed Description

Member Enumeration Documentation

Member Function Documentation

Member Data Documentation

interfaces

API Reference