jwz - text to speech? [entries|archive|friends|userinfo]
jwz

  www.jwz.org
  userinfo
  archive
  rss

Links
[»| DNA (Log) (iCal) WebCollage (LJ) Mixtapes ]

text to speech? [Mon, 11-Jul-2005 3:36 PM]
Previous Entry Add to Memories Tell a Friend Next Entry
[Tags|, , ]
[music |Skunk Anansie -- Cheap Honesty]

I know OSX has all this text-to-speech junk built in, but is there some simple way to take a text file and convert it to an audio file (WAV, AIFF, MP3, anything)?

Update: "say -f foo.txt -o foo.aiff". How about that!

linkReply

Comments:
[User Picture]From: [info]cvisors
Mon, 11-Jul-2005 3:40 PM (UTC)

(Link)

there is a command called say, which you can pipe stuff into, say -o foo.aiff will write the output to a file.

-b
[User Picture]From: [info]jwz
Mon, 11-Jul-2005 7:29 PM (UTC)

(Link)

Thank you. Thank you for actually answering the question, and thank you for not being like all these other cock-knockers below.
[User Picture]From: [info]strspn
Tue, 12-Jul-2005 3:12 AM (UTC)

(Link)

Comparison of question-answer fora:

Ask SlashdotGoogle AnswersUSENETjwz lazyweb
accuracy:fairgoodboth depend on phase of moon
SNR:lowhighrock-bottomnever good enough
advantages:fairly quicksemi-professionalworldwide guru coveragehighly entertaining
disadvantages:troll infestationpay to playinfinite spamtopics limited to current jwz frustration (could easily be more limited)
beware of:GNAAdifferences over what constitutes an answerNigeriansswift wrath
failure mode:if answer not evident in first hour, only posts mocking question get modded up; must read all posts, Newest First, to get real answersmoney squablescascades from regulars flaming off-topickernel recompilation or switch-back-to-linux imperatives
what you will learn:far more detail than you wanted to knowhow easily you could have found what you just paid $20 for if you had spent another few minutes lookingthat your question is either in the FAQ or a thread from last weekmulticultural sexuality, technogoth DJ tips, and how to run a nightclub
what you won't learn:whether the answer works without overclocking, a RC UAV case mod, and stolen winmodem driver sourcehow the geek who answered your $50 question in ten minutes found itthe canonical reference found in most libraries instead of the same answer in an obscure 1978 Ph.D. thesispatience
worthwhile for:often the top world expert provides the answertireless dedication of tens of thousands of food-stamp recipients working from their parents' basementtenured subject experts still reading with trn on 24x80 Zenith-19s (because office clutter doesn't allow terminal upgrades) not distracted by web pornangst-charged luser plonkings
respondents' motivations:damn editor bitchslapped dupe complaints last week, excellent karma status at riskmaking an easy bucknemisis on other continent might upstage by providing quicker answerone word: fanboys
question poster's motivation:don't know how to use Googledon't know how to use Ask Slashdottoo poor to use Google Answerscomputers hate jwz
50 years from now:OSDN purchased by TimeDisneyOL; text no longer permitted on /.Google purchased by Microsoft; only certified copies of IE for Win2055 Pro allowed to post questionsUSENET posts classified as "indymedia copyrestriction crime" -- only graft by spam harvesters allows it to continueGeneral advice on cleaning up messy spills to giant teledildonic service robot manequins
[User Picture]From: [info]zetawoof
Thu, 14-Jul-2005 1:27 PM (UTC)

(Link)

Your table is seriously breaking the layout. At least, I think I'm responding to [info]strspn's comment, which is currently below my comment box. I think you mislaid a row somewhere.
[User Picture]From: [info]strspn
Thu, 14-Jul-2005 6:13 PM (UTC)

(Link)

Sorry, I don't know why it did that. The HTML is perfect except for a missing slash before the final td tag. Still, that's perfectly legal and it shouldn't be messing everything else up.
[User Picture]From: [info]zetawoof
Thu, 14-Jul-2005 6:20 PM (UTC)

(Link)

On a second look, it's just screwing up its own comment box a bit. Never mind.
[User Picture]From: [info]jwz
Thu, 14-Jul-2005 6:22 PM (UTC)

(Link)

No, it has completely fucked the entire layout of the page if you look at it in Generator style.
[User Picture]From: [info]zetawoof
Thu, 14-Jul-2005 6:30 PM (UTC)

(Link)

In which case I stand correct in the first place. Looks almost correct in the default BML view, though. I wonder why the layout's failing so dramatically.
[User Picture]From: [info]grumpy_sysadmin
Mon, 11-Jul-2005 10:05 PM (UTC)

(Link)

Holy crap!

lazyweb works?!?

I may have to re-examine my whole outlook on life...
[User Picture]From: [info]edlang
Tue, 12-Jul-2005 1:22 AM (UTC)

small request

(Link)

I know how much you hate teh lunix, but as I don't have a Magical Mac could you please, if you've some idle time, upload samples of piping the text through the chef / B1FF / etc filters before they go through /usr/bin/say?

[User Picture]From: [info]the_p0pe
Mon, 11-Jul-2005 4:01 PM (UTC)

(Link)

I find it incredibly hard to believe that you haven't heard of festival before. It's a bit bloaty, but kinda neat. The suggestion to use "say" is better though.
[User Picture]From: [info]nugget
Mon, 11-Jul-2005 4:15 PM (UTC)

(Link)

Not wanting to use festival can be the result of not knowing about it...

Or just as likely, not wanting to use festival can be the result of knowing about it.

I think you presume too much. Festival isn't much easier than writing a Linux driver to control a Speak n' Spell, with similar quality results.
[User Picture]From: [info]the_p0pe
Mon, 11-Jul-2005 4:26 PM (UTC)

(Link)

Or just as likely, not wanting to use festival can be the result of knowing about it.

Bingo
[User Picture]From: [info]violentbloom
Mon, 11-Jul-2005 4:43 PM (UTC)

(Link)

all the good voices are on windows :( which sucks on a number of levels.
[User Picture]From: [info]spike
Mon, 11-Jul-2005 5:02 PM (UTC)

(Link)

There are some great (commercial, $30ea) voices for Mac OS X's native text-to-speech synth available from Cepstral. If you're even a moderate user of TTS, they can be a fun addition to the standard set. And if anyone else hears your TTS audio, using one of the Cepstral voices sets it apart from the usual choices.

Now about getting those National Weather Service voices...
[User Picture]From: [info]curious_au
Mon, 11-Jul-2005 5:11 PM (UTC)

(Link)

Thirty dollars? I can get one of my students to sit under the desk and read it off an LCD for that price!
From: [info]err0neous
Mon, 11-Jul-2005 6:12 PM (UTC)

WTF

(Link)

WTF is wrong with you people?? The first god damn post answered the question concisely and correctly. I don't understand why you all feel the need to continue to go on about garbage like festival (you morons think that compiling some software is somehow easier than 'say?') and other shit that you have to pay for!

I suppose you are all probably just doing it on purpose to piss off jwz.
[User Picture]From: [info]fantasygoat
Mon, 11-Jul-2005 6:40 PM (UTC)

Re: WTF

(Link)

Hmm, now that you mention it.
[User Picture]From: [info]spike
Mon, 11-Jul-2005 7:12 PM (UTC)

(Link)

Um, the Cepstral voices work with the built-in Mac OS X text-to-speech engine. They are plug-ins for it. They extend it. They make it work better. In this case the "shit" that you "have to pay for" is completely optional.
[User Picture]From: [info]spike
Mon, 11-Jul-2005 7:15 PM (UTC)

(Link)

Thirty dollars? I can get one of my students to sit under the desk and read it off an LCD for that price!

If I'm going to pay someone $30 to get under my desk, "reading text off an LCD" might not be the first thing I'd want them to do, but maybe that's just me.

I wonder if there's a "talking with your mouth full" voice module out there somewhere?
[User Picture]From: [info]ziggy_san
Tue, 9-Aug-2005 2:45 PM (UTC)

(Link)

If I'm going to pay someone $30 to get under my desk, "reading text off an LCD" might not be the first thing I'd want them to do, but maybe that's just me.

I wonder if there's a "talking with your mouth full" voice module out there somewhere?


I laughed. Giggled, even.
[User Picture]From: [info]jwz
Mon, 11-Jul-2005 7:26 PM (UTC)

(Link)

You're a dumbass.
[User Picture]From: [info]ziggy_san
Tue, 9-Aug-2005 2:40 PM (UTC)

(Link)

Dude, holy crap...

Either I'm a retard when it comes to computers, or that festival program is really freakin' hard to install. Compiling? C++? ARGH! Plus, there's like 23 different files to download and I have no clue which ones I need!

Yeah, I guess I could Google for that C++ compiler thing but...isn't there anything out there that's a tad easier to install?
[User Picture]From: [info]toucansam616
Mon, 11-Jul-2005 5:27 PM (UTC)

(Link)

who are you?

i searched for crack monkey on google and found this
[User Picture]From: [info]gargargar
Mon, 11-Jul-2005 6:47 PM (UTC)

Oh dear.

(Link)

It's official then, I now hate google.
[User Picture]From: [info]jwz
Mon, 11-Jul-2005 7:25 PM (UTC)

(Link)

I'm your mom, and I should have sold you when I had the chance.
[User Picture]From: [info]jerub
Mon, 11-Jul-2005 10:41 PM (UTC)

(Link)

automator makes it easy. you can set up automator to allow you to drag a text file onto a workflow that will create an audio file. You even get a choice of what voice to use.

Assuming you have 10.4 of course.
[User Picture]From: [info]jwz
Mon, 11-Jul-2005 10:50 PM (UTC)

(Link)

Wow, that sounds so much easier than using /usr/bin/say!

But hey thanks for reading before posting. You know how that really makes my day.
From: [info]wsxyz
Tue, 12-Jul-2005 7:18 AM (UTC)

what?

(Link)

What does "drag a text file onto a workflow" mean anyway?
[User Picture]From: [info]airmax
Tue, 12-Jul-2005 1:03 PM (UTC)

(Link)

Once upon a veery long time ago there was such nifty tool for DOS. But that looks more like one of useless facts I have.