473,626 Members | 3,294 Online
Bytes | Software Development & Data Engineering Community
+ Post

Home Posts Topics Members FAQ

managing sound in a dotNet app

I am working on a foreign language learning program. (C# DotNet V2) I have
some written text, and a recording of a native speaker speaking that text. I
would like to be able to do the following:

1 - synchronize the voice with the text, so that the background color of
each word is hilighted as the voice reads that word.

2 - be able to jump around within the recorded text. For instance if the
user double-clicks a certain sentence - just that sentence is read. Currently
the speech is in one file per paragraph.

3 - be able to slow down or speed up the pace of the spoken text (while not
making it sound ridiculous), also the synchronization in step one should not
be broken by this process.

I am a rank amateur at audo, so any ideas on how to do this, what
sound-editing tools would be good to use, etc. will be good for your karma!
(and really appreciated by me.)
May 13 '06 #1
4 1493
To be right on sync, you will need some form of library that can actual
capture the audio. An cheaper option would be to get the timing of the words
and set up a metadata file for the audio (can be stored in a database or on
the file system (perhaps even in a resource file, athough that is not their
primary use)). When the clip hits a certain timing, highlight that word.

--
Gregory A. Beamer

*************** *************** *************** ****
Think Outside the Box!
*************** *************** *************** ****
"jdcharbonn eau" <jd***********@ discussions.mic rosoft.com> wrote in message
news:D5******** *************** ***********@mic rosoft.com...
I am working on a foreign language learning program. (C# DotNet V2) I have
some written text, and a recording of a native speaker speaking that text.
I
would like to be able to do the following:

1 - synchronize the voice with the text, so that the background color of
each word is hilighted as the voice reads that word.

2 - be able to jump around within the recorded text. For instance if the
user double-clicks a certain sentence - just that sentence is read.
Currently
the speech is in one file per paragraph.

3 - be able to slow down or speed up the pace of the spoken text (while
not
making it sound ridiculous), also the synchronization in step one should
not
be broken by this process.

I am a rank amateur at audo, so any ideas on how to do this, what
sound-editing tools would be good to use, etc. will be good for your
karma!
(and really appreciated by me.)

May 13 '06 #2
Thanks. The audio is pre-recorded, so an application to capture it isn't
needed. I do need an applicaiton that let's me edit the timing (probably by
hitting a key at the beginning of every new sentence, then every word).
Ideally this application would also store the 'metaData' (is there some sort
of standard for this?) Also it needs to be easy to edit mistakes.

Any ideas on where I can get a sound editor like this?

Thanks again

"Cowboy (Gregory A. Beamer)" wrote:
To be right on sync, you will need some form of library that can actual
capture the audio. An cheaper option would be to get the timing of the words
and set up a metadata file for the audio (can be stored in a database or on
the file system (perhaps even in a resource file, athough that is not their
primary use)). When the clip hits a certain timing, highlight that word.

--
Gregory A. Beamer

*************** *************** *************** ****
Think Outside the Box!
*************** *************** *************** ****
"jdcharbonn eau" <jd***********@ discussions.mic rosoft.com> wrote in message
news:D5******** *************** ***********@mic rosoft.com...
I am working on a foreign language learning program. (C# DotNet V2) I have
some written text, and a recording of a native speaker speaking that text.
I
would like to be able to do the following:

1 - synchronize the voice with the text, so that the background color of
each word is hilighted as the voice reads that word.

2 - be able to jump around within the recorded text. For instance if the
user double-clicks a certain sentence - just that sentence is read.
Currently
the speech is in one file per paragraph.

3 - be able to slow down or speed up the pace of the spoken text (while
not
making it sound ridiculous), also the synchronization in step one should
not
be broken by this process.

I am a rank amateur at audo, so any ideas on how to do this, what
sound-editing tools would be good to use, etc. will be good for your
karma!
(and really appreciated by me.)


May 13 '06 #3
Metadata is just the needed data in a format you want that tells you what you
want to know.

IE: for this application, maybe an XML file that you serialize upon startup
that lists the words to highlight at each point in time, so at 0.5 sec
highlight the second word, 1sec highlight the third, etc... Then a program
that you can write to just click the words that are on the screen as you
listen to the audio. So start the audio and record the datetime down to the
ms when you started it, then at each click record the difference in ms and
save that to an xml file. Now when you start the program, open the xml file
and it should show the words on the screen, and also have the hierarchy of
the word timing loaded into memory. Then just begin the audio and start your
highlighting.

A rough idea of what you're looking to do, but kind of gives you a starting
algorithm anyway.
--
KMG Software, Inc.
Thinking Beyond, Above, and Before
http://www.kmgsoftware.com
"jdcharbonn eau" wrote:
Thanks. The audio is pre-recorded, so an application to capture it isn't
needed. I do need an applicaiton that let's me edit the timing (probably by
hitting a key at the beginning of every new sentence, then every word).
Ideally this application would also store the 'metaData' (is there some sort
of standard for this?) Also it needs to be easy to edit mistakes.

Any ideas on where I can get a sound editor like this?

Thanks again

"Cowboy (Gregory A. Beamer)" wrote:
To be right on sync, you will need some form of library that can actual
capture the audio. An cheaper option would be to get the timing of the words
and set up a metadata file for the audio (can be stored in a database or on
the file system (perhaps even in a resource file, athough that is not their
primary use)). When the clip hits a certain timing, highlight that word.

--
Gregory A. Beamer

*************** *************** *************** ****
Think Outside the Box!
*************** *************** *************** ****
"jdcharbonn eau" <jd***********@ discussions.mic rosoft.com> wrote in message
news:D5******** *************** ***********@mic rosoft.com...
I am working on a foreign language learning program. (C# DotNet V2) I have
some written text, and a recording of a native speaker speaking that text.
I
would like to be able to do the following:

1 - synchronize the voice with the text, so that the background color of
each word is hilighted as the voice reads that word.

2 - be able to jump around within the recorded text. For instance if the
user double-clicks a certain sentence - just that sentence is read.
Currently
the speech is in one file per paragraph.

3 - be able to slow down or speed up the pace of the spoken text (while
not
making it sound ridiculous), also the synchronization in step one should
not
be broken by this process.

I am a rank amateur at audo, so any ideas on how to do this, what
sound-editing tools would be good to use, etc. will be good for your
karma!
(and really appreciated by me.)


May 15 '06 #4
Thank you very much. That might work, but it has a couple of problems:

1 - depending on the system load, CPU speed, load, etc. the amount of time
the recording takes to start playing varies from system to sytem and
different times on the same system. This makes syncing at startup difficult

2 - I'd like to be able to pause and continue - which will exagerate the
above problem.

3 - I'd like to be able to slow-down or speed up the recording.

For these reasons I was thinking of triggersthat would be embedded in the
actual sound file itself.
"Brandon @ KMG" wrote:
Metadata is just the needed data in a format you want that tells you what you
want to know.

IE: for this application, maybe an XML file that you serialize upon startup
that lists the words to highlight at each point in time, so at 0.5 sec
highlight the second word, 1sec highlight the third, etc... Then a program
that you can write to just click the words that are on the screen as you
listen to the audio. So start the audio and record the datetime down to the
ms when you started it, then at each click record the difference in ms and
save that to an xml file. Now when you start the program, open the xml file
and it should show the words on the screen, and also have the hierarchy of
the word timing loaded into memory. Then just begin the audio and start your
highlighting.

A rough idea of what you're looking to do, but kind of gives you a starting
algorithm anyway.
--
KMG Software, Inc.
Thinking Beyond, Above, and Before
http://www.kmgsoftware.com
"jdcharbonn eau" wrote:
Thanks. The audio is pre-recorded, so an application to capture it isn't
needed. I do need an applicaiton that let's me edit the timing (probably by
hitting a key at the beginning of every new sentence, then every word).
Ideally this application would also store the 'metaData' (is there some sort
of standard for this?) Also it needs to be easy to edit mistakes.

Any ideas on where I can get a sound editor like this?

Thanks again

"Cowboy (Gregory A. Beamer)" wrote:
To be right on sync, you will need some form of library that can actual
capture the audio. An cheaper option would be to get the timing of the words
and set up a metadata file for the audio (can be stored in a database or on
the file system (perhaps even in a resource file, athough that is not their
primary use)). When the clip hits a certain timing, highlight that word.

--
Gregory A. Beamer

*************** *************** *************** ****
Think Outside the Box!
*************** *************** *************** ****
"jdcharbonn eau" <jd***********@ discussions.mic rosoft.com> wrote in message
news:D5******** *************** ***********@mic rosoft.com...
>I am working on a foreign language learning program. (C# DotNet V2) I have
> some written text, and a recording of a native speaker speaking that text.
> I
> would like to be able to do the following:
>
> 1 - synchronize the voice with the text, so that the background color of
> each word is hilighted as the voice reads that word.
>
> 2 - be able to jump around within the recorded text. For instance if the
> user double-clicks a certain sentence - just that sentence is read.
> Currently
> the speech is in one file per paragraph.
>
> 3 - be able to slow down or speed up the pace of the spoken text (while
> not
> making it sound ridiculous), also the synchronization in step one should
> not
> be broken by this process.
>
> I am a rank amateur at audo, so any ideas on how to do this, what
> sound-editing tools would be good to use, etc. will be good for your
> karma!
> (and really appreciated by me.)

May 15 '06 #5

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

3
1286
by: Eric Caron | last post by:
Hi all, I have a property in one of my classes that represents an optional date. I'm trying to figure out if I can represent Null or Nothing as the value for a Date data type. Branching in the data access layer to send a Null to the stored procedure is no problem, but managing a Null date in the business object is another matter. I'm thinking of using a specific date (like 01-01-0001) to define the date as being empty/Null. Any...
5
3598
by: Mike Newman | last post by:
Does anyone know if it is possible to program the PC's sound card using Visual Studio.NET in basic? If so, how can I get some information? I just want to sound notes of a specified frequency either melodically in sequence or simultaneously in a chord. Thanks Mike
5
4684
by: Nick | last post by:
Hi there, Im trying to use the PlaySnd API to play a system sound, such as "Asterisk" or "Question" within the windows section. Anyone got any ideas on how this is done? I've tried using the SND_APPLICATION and SND_ALIAS flags but I'm thinking that my name isnt correct in the first place, should I be using "Windows\Asterisk"? Thanks in advance. -- Nick Pateman
6
2424
by: martin1 | last post by:
Hi, All, The app retrieve sound.wav file from my loacal C drive, is there anyway the sound file can be stored in the app? then when building, it goes with app rather than sending sound file seperate with app when deploying the app. Thanks
9
7701
by: Roger | last post by:
When I run the following code it does not play on the client. When the server and client are on the same machine it plays??? Any help from anyone??? Dim errorSound As New System.Media.SoundPlayer() errorSound.SoundLocation = ErrorLocation errorSound.Load() errorSound.Play()
8
3861
by: jer | last post by:
I've noticed you've helped a lot of people that were trying to do sound recording through VB.net. I'm searched all over the internet, and I'm still having problems write sound to a wav file. My code originate from a VB 6 application, and I'm trying to use vb 2005. I was having a lot of issues until I saw your post that cleaned up a lot of the structures and DLL calls in this: ...
3
1559
by: Nobody | last post by:
Hi all, It seems there's no way to record sound file with the 2.0 framework. Does anybody knows how to easily do it? Thanks in advance
2
260
by: Fine | last post by:
Hi Every one What I do when I click to run a program and produce sound?. In C++ I was using "Escape sequence a/" What I do in VB/ Can any person help? Thank you all ===========================
0
907
by: Marcolino | last post by:
Hi all, I need to write a little application in vb.net that can change the Default Audio device in windows, without access every time to control panel -Sound and Audio Device. Do you have any suggestion? Thanks
0
8196
by: Hystou | last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can effortlessly switch the default language on Windows 10 without reinstalling. I'll walk you through it. First, let's disable language synchronization. With a Microsoft account, language settings sync across devices. To prevent any complications,...
0
8701
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers, it seems that the internal comparison operator "<=>" tries to promote arguments from unsigned to signed. This is as boiled down as I can make it. Here is my compilation command: g++-12 -std=c++20 -Wnarrowing bit_field.cpp Here is the code in...
1
8364
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows Update option using the Control Panel or Settings app; it automatically checks for updates and installs any it finds, whether you like it or not. For most users, this new feature is actually very convenient. If you want to control the update process,...
1
6122
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome a new presenter, Adolph Dupré who will be discussing some powerful techniques for using class modules. He will explain when you may want to use classes instead of User Defined Types (UDT). For example, to manage the data in unbound forms. Adolph will...
0
4090
by: TSSRALBI | last post by:
Hello I'm a network technician in training and I need your help. I am currently learning how to create and manage the different types of VPNs and I have a question about LAN-to-LAN VPNs. The last exercise I practiced was to create a LAN-to-LAN VPN between two Pfsense firewalls, by using IPSEC protocols. I succeeded, with both firewalls in the same network. But I'm wondering if it's possible to do the same thing, with 2 Pfsense firewalls...
0
4196
by: adsilva | last post by:
A Windows Forms form does not have the event Unload, like VB6. What one acts like?
1
2623
by: 6302768590 | last post by:
Hai team i want code for transfer the data from one system to another through IP address by using C# our system has to for every 5mins then we have to update the data what the data is updated we have to send another system
1
1807
muto222
by: muto222 | last post by:
How can i add a mobile payment intergratation into php mysql website.
2
1507
bsmnconsultancy
by: bsmnconsultancy | last post by:
In today's digital era, a well-designed website is crucial for businesses looking to succeed. Whether you're a small business owner or a large corporation in Toronto, having a strong online presence can significantly impact your brand's success. BSMN Consultancy, a leader in Website Development in Toronto offers valuable insights into creating effective websites that not only look great but also perform exceptionally well. In this comprehensive...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.