sumom****@users*****
sumom****@users*****
2008年 10月 2日 (木) 17:23:45 JST
Index: julius4/adintool/00readme-ja.txt diff -u julius4/adintool/00readme-ja.txt:1.2 julius4/adintool/00readme-ja.txt:1.3 --- julius4/adintool/00readme-ja.txt:1.2 Tue Dec 18 23:08:21 2007 +++ julius4/adintool/00readme-ja.txt Thu Oct 2 17:23:45 2008 @@ -1,206 +1,214 @@ + adintool + ADINTOOL(1) ADINTOOL(1) -NAME - adintool - audio tool to record/split/send/receive speech data for - Julius +¼O + adintool + - ¹ºg`f[^ÌL^EªEMEóMc[ -SYNOPSIS - adintool -in inputdev -out outputdev [options...] +Tv + adintool {-in inputdev} {-out outputdev} [options...] DESCRIPTION - adintool ÍC¹ºg`f[^̹ºæÔÌo¨æÑL^ðA±IÉ s ¤ - c[ Å·Dü͹ºÉεÄëð·ÆUxÉîùºæÔoð + adintool ÍC¹ºg`f[^̹ºæÔÌo¨æÑL^ðA±IÉs ¤ + c[Å·Dü͹ºÉεÄëð·ÆUxÉîùºæÔ oð s¢C¹ºæÔªðA±o͵ܷD adintool Í adinrec Ì@\ÅÅ·D¹ºf[^ÌüͳƵÄC}CN ü - ÍE ¹ ºg`t@CEWüÍElbg[NüÍ(adinnet T[o[[ - h)ªIðÅ«Ü·DܽCoÍæƵÄC¹ºg`t@CEWoÍElb - g [NoÍ(adinnet NCAg[h)ªIðÅ«Ü·DÁÉlbg[ - NoÍiadinnet NCAg[hjÅÍC julius Ölbg[NoR Å - ¹ºðMµÄ¹ºF¯³¹é±ÆªÅ«Ü·D - - ü ͹ºÍ¹ºæÔ²ÆÉ©®ª³êCoͳêÜ·D¹ºæÔÌØè - oµÉÍ adinrec Ư¶CêèÔàÌëð·Æp[iUxj̵ - « ¢lðp¢Ü·D¹ºæÔJnƯɹºoͪJn³êÜ·DoÍƵ - Ät@CoÍðIñ¾êÍCAÔt@C¼Åo³ê½æÔ²ÆÉÛ ¶ - µÜ·D - - TvOügÍCÓÉÝèÂ\Å·D`®Í 16bit monoral Å·D« - o³êéf[^`®Í WAV, 16bit, mÅ·Dùɯ¶¼OÌt@Cª - ¶Ý·éêÍ㫵ܷD - -INPUT - ¹ºðÇÝÞüÍfoCXÍȺ̤¿Çê©ðwèµÜ·D - - -in mic - }CNüÍiftHgjD - - -in file - ¹ºg`t@CD`®Í RAW (16bit big endian)CWAV(³³k)ÈÇ - iRpCÌÝèÉæéjD - ȨCüÍt@C¼ÍN®ãÉCvvgÉεÄüÍ·éD - - -in adinnet - adinnet T[o[ÆÈÁÄlbg[NoRÅ adinnet NCA g - © 繺f[^ðó¯æéD adinnet NCAg©çÌTCP/IPÚ± - ðÒ¿CÚ±ªm§µ½ãÍ adinnet NCAg©ç¹ºf[^ ð - ó¯æéD - |[ g ÔÌftHgÍ 5530 Å éD±êÍIvV "-port" - ÅÏXÂ\D - - -in netaudio - (T|[g³êÄ¢êÎj¹ºf[^ðNetaudio/DatLinkT[o[© ç - ó ¯æéDT[oÌzXg¼Æjbg¼ð "-NA host:unit" Åwè - ·éKvª éD - - -in stdin - WüÍD¹ºf[^`®Í RAW, WAV ÌÝD - -OUTPUT - oµ½¹ºæÔ̹ºf[^ð«o·oÍfoCXƵÄCȺ̤¿ Ç - ê©ðwèµÜ·D - - -out file - t@ C Ö oÍ·éDoÍt@C¼ÍÊÌIvV"-filename - foobar" Ì`Å^¦éDÀÛÉÍ "foobar.0000" , "foobar.0001" ... - Ì æ¤ÉæÔ²ÆÉCwèµ½¼OÌöÉ4 ÌID𯽼OÅL - ^³êÈéD ID Í 0 ðúlƵÄC¹ºæÔo²ÆÉPÁ · - éD úlÍIvV "-startid" ÅÏXÂ\Å éDܽCoÍ - t@C`®Í 16bit WAV `®Å éD RAW `®ÅoÍ · é É Í - "-raw" IvVðwè·éD - - -out adinnet - adinnet NCAgÆÈÁÄClbg[NoRÅ adinnet T[o - Ö¹ºf[^ðéDüÍÌÆÍtÉC adintool Í adinnet N - C A g ÆÈèCadinnet T[o[ÖÚ±ãC¹ºf[^ðM· - éDadinnet T[o[ƵÄÍC adintool ¨æÑ Julius Ìadinnet - üͪ°çêéD - "-server" ÅMæÌadinnetT[oÌzXg¼ðwè·éDܽ|[ - gÔÌftHgÍ 5530 Å éD±êÍIvV "-port" Å Ï - XÂ\D - - -out stdout - W oÍÖoÍ·éD`®Í RAW, 16bit signed (big endian) Å - éD - + ÍE ¹ºg`t@CEWüÍElbg[NüÍ(adinnet T[o[ [ + h)ªIðÅ«Ü·DJulius Ì -input IvVà gpÂ\ÅCvOC + üÍàIðÅ«Ü·D + + oÍæƵÄC¹ºg`t@CEWoÍElbg[NoÍ(adinnet N + CAg[h)ªIðÅ«Ü·DÁÉlbg[NoÍiadinnet NC + Ag[hjÅÍC julius Ölbg[NoRŹºðMµÄ¹ºF¯ + ³¹é±ÆªÅ«Ü·D + + ü͹ºÍ¹ºæÔ²ÆÉ©®ª³êCoͳêÜ·D¹ºæÔÌØ è + oµÉÍ adinrec Ư¶CêèÔàÌëð·Æp[iUxj Ì + µ«¢lðp¢Ü·D¹ºæÔJnƯɹºoͪJn³êÜ·Do ÍÆ + µÄt@CoÍðIñ¾êÍCAÔt@C¼Åo³ê½æÔ²Æ ÉÛ + ¶µÜ·D + + TvOügÍCÓÉÝèÂ\Å·D^¹`®Í 16bit, 1 channel + ÅC«o³êét@C`®Í Microsoft WAV `®Å·D ùɯ¶¼OÌ + t@Cª¶Ý·éêÍ㫳êÜ·D OPTIONS - -server host[,host...] - "-out adinnet" ÅM·éT[oÌzXg¼ðwè·éDR}æ Ø - èÅ¡ÌT[oðwèÂ\D - - -port num[,host...] - "-out adinnet" ÅM·éT[oÌ|[gÔD¡T[ogpÍ - R}æØèÅ "-server"Éγ¹ÄSÄÌÔðwè·é ± ÆD - (default: 5530) - - -nosegment - ü͹ºÉεĹºæÔÌoðsí¸C»ÌÜÜoÍÖ_CN - g·éDt@CoÍÌêCt@C¼ÌöÉ4 ÌIDÍt^³ ê - ÈÈéD - - -oneshot - üÍJnãCêÔÅÌP¹ºæÔÌÝðMãCI¹·éD - - -freq threshold - TvOügDPÊÍ Hz (default: 16000) - - -48 48kHzTvOãC16kHzÉ_ETvO·éD - - -lv threslevel - g`ÌUx̵«¢l (0 - 32767)D(default: 2000)D - - -zc zerocrossnum - Pb ½èÌëð·Ìµ«¢l (default: 60) - - -headmargin msec - ¹ºæÔJn̼OÌ}[WDPÊÍ~b (default: 400) - - -tailmargin msec - ¹ºæÔI¹Ì¼ãÌ}[WDPÊÍ~b (default: 400) - - -nostrip - ³ øÈ 0 TvÌ©®ðsíÈ¢æ¤É·éDftHgÍ© - ®ðs¤D - - -zmean DC¬ªðs¤D + Julius ÌSÄÌIvVªwèÂ\Å éDwè³ê½àÌ̤¿C ¹º + üÍÉÖW·éIvVÌݵíêéDȺÉCadintool Ì IvVC + ¨æÑLøÈ Julius IvVððà·éD + + adintool specific options + -freq Hz + ¹ºÌTvOüg (Hz) ðwè·éD(default: 16,000) + + -in inputdev + ¹ºðÇÝÞüÍfoCXðwè·éD"mic" Å}CNüÍC "file" Å + t@CüÍ, "stdin" ÅWüͩ繺ðÇÝÞD t@CüÍÌ + êCt@C¼ÍN®ãÉoÄévvgÉÎ µÄwè·éDÜ + ½C"adinnet" Å adintool Í adinnet T[o[ ÆÈèCadinnet NC + Ag©ç¹ºf[^ð tcp/ip oRÅ ó¯æéD|[gÔÍ 5530 Å + éi"-inport" ÅÏXÂ\jD + + üÍfoCXÍC»ÌÙ© Julius Ì "-input" IvVÅàwèÂ\ + Å éD»ÌêCvOC©çÌüÍàÂ\Å éD + + -out outputdev + ¹ºðoÍ·éfoCXðwè·éD"file" Åt@CoÍC stdout Å + WoÍÖoÍ·éDt@CÌêCoÍt@C¼Í IvV + "-filename" Å^¦éDoÍt@C `®Í 16bit WAV `®Å éD Ü + ½C"adinnet" Å adintool Í adinnet NCAg ÆÈèCadinnet + T[oÖæèñ¾¹ºf[^ð tcp/ip oRÅ MÅ«éDMæzX + gÍ "-server" Åwè·éD |[gÔÍ 5530 Å éi"-port" ÅÏX + Â\jD + + -inport num + üͪ adinnet Ìê (-in adinnet)CÚ±ðó¯é|[gÔ ðwè· + éDwèµÈ¢êÌftHgÍ 5530 Å éD + + -server [host] [,host...] + oͪ adinnet Ìê (-out adinnet)CMæÌT[o¼ðwè·éD + ¡ éêÍCJ}ÅæØÁÄwè·éD + + -port [num] [,num...] + oͪ adinnet Ìê (-out adinnet)CMæÌeT[oÌ|[gÔ + ðwè·éDwèµÈ¢êÌftHgÍ 5530 Å éD -server Å¡ + ÌT[oðwèµÄ¢éêC SÄÉ¢ľ¦IÉ|[gÔðwè + ·éKvª éD + + -filename file + t@CoÍ (-out file) CoÍt@C¼ð ^¦éDftHgÅ + ÍCo³ê½¹ºæÔo²ÆÉC "file.0000.wav" , "file.0001.wav" + ... Ìæ¤ÉæÔ²ÆÉAÔÅ L^³êéDÔÌúlÍ 0 Å é + i-startidÅ ÏXÂ\jDȨCIvV -oneshot wèÍ ÅÌæ + Ô¾¯ª "file" ̼OÅÛ¶³êéD + + -startid number + t@CoÍCL^ðJn·éAÔÔÌúlðwè·éDi ftH + gF0j + + -oneshot + Å̹ºæÔªI¹µ½çI¹·éD + + -nosegment + ü͹ºÌ¹ºæÔoi³¹ÉæéæØèƳ¹æÔÌXLbvjð s + íÈ¢D + + -raw + RAWt@C`®ÅoÍ·éD + + -autopause + oͪ adinnet Ìêi-out adinnetjC¹ºæÔªI¹·é½Ñ ÉüÍ + â~E®ìâ~óÔÉÚs·éDoÍæÌ adinnet T[o© ç®ìÄJM + ªéƹºüÍðÄJ·éD + + -loosesync + oͪ adinnet i-out adinnetjÅ¡ÌoÍæT[oÖo͵Ģé ê + C®ìâ~óÔ©ç®ìÄJMÉæÁÄ®ìðÄJ·éÛCadintool + Í ·×ÄÌT[o©ç®ìÄJMðó¯éÜÅ®ìðÄJµÈ¢D ±Ì + IvVðwè·éÆCÈÆàPÂÌT[o©çÄJM ªêÎ + ®ìðÄJ·éæ¤ÉÈéD + + -rewind msec + üͪ}CNÌÆ«Câ~óÔ©ç®ìðÄJ·éÆ«Câ~©ç ± + µÄ¹ºü;Á½êCwè³ê½~bª¾¯³©ÌÚÁÄ ^¹ð + Jn·éD + + Concerning Julius options + -input {mic|rawfile|adinnet|stdin|netaudio|esd|alsa|oss} + ¹ºüÍ\[XðIð·éD"-in" Ìãíèɱ¿çðg¤±Æà Å«é + iÅãÉwèµ½Ù¤ªDæ³êéjDesd âvOCüͪ wèÂ\ + Å éD + + -lv thres + Ux̵«¢lDlÍ 0 ©ç 32767 ÌÍÍÅwè·éD (default: + 2000) + + -zc thres + ëð·Ìµ«¢lDlÍPb ½èÌð·Åwè·éD (default: 60) + + -headmargin msec + ¹ºæÔJnÌ}[WDPÊÍ~bD (default: 300) + + -tailmargin msec + ¹ºæÔI¹Ì}[WDPÊÍ~bD (default: 400) + + -zmean + ü͹ºXg[Éεļ¬¬ªðs¤DSÄ̹ºÌ ÌO + iƵijêéD + + -smpFreq Hz + ¹ºÌTvOüg (Hz) ðwè·éD(default: 16,000) + + -48 + 48kHzÅüÍðs¢C16kHzÉ_ETvO·éD ±êÍ 16kHz Ì + fðgpµÄ¢éÆ«ÌÝLøÅ éD _E_vOÌà@\ + Í sptk ©ç ÚA³ê½D (Rev. 4.0) + + -NA devicename + DatLink T[oÌfoCX¼ (-input netaudio). + + -adport port_number + + -input adinnet gpCÚ±ðó¯t¯é adinnet Ì{[gÔðwè· + éD(default: 5530) + + -nostrip + ¹ºæèÝCfoCXât@CÉæÁÄÍC¹ºg`ÉU ª + "0" ÆÈét[ª¶Ý·é±Æª éDJulius ÍÊíC¹º üÍÉÜ + Üêé»Ìæ¤Èt[ð·éD±ÌëTvª ¤Ü®© + È¢êC±ÌIvVðwè·é±ÆÅ©®Áð ³ø»·é±Æª + Å«éD + + -C jconffile + jconfÝèt@CðÇÝÞDt@CÌàeª±ÌêÉWJ³êéD + + -plugindir dirlist + vOCðÇÝÞfBNgðwè·éD¡ÌêÍ RÅæ + ØÁÄÀ×Äwè·éD + +ENVIRONMENT VARIABLES + ALSADEV + (}CNüÍÅ alsa foCXgp) ^¹foCX¼ðwè·éD wèª + È¢êÍ "default"D + + AUDIODEV + (}CNüÍÅ oss foCXgp) ^¹foCX¼ðwè·éD wèª + È¢êÍ "/dev/dsp"D + + LATENCY_MSEC + Linux (alsa/oss) ¨æÑ Windows ÅC}CNüÍÌxÔð~ b + PÊÅwè·éDZ¢lðÝè·é±ÆÅüÍxð¬³Å« éªCCPU + Ìתå«ÈèCܽ«ÉæÁÄÍvZXâOSÌ ®ªsÀèÉ + Èé±Æª éDÅKÈlÍOS âfoCXÉå« Ë¶·éDftHg + lÍ®ì«É˶·éD - -raw t@CoÍ`®ð RAW, 16bit signed (big endian) É·éDftH - gÍ WAV `®Å éD - - -autopause - üÍI¹²ÆÉ©®IÉ pause óÔÉüéD - - -loosesync - ¡ T[oÚ±C¡T[o©çÌ pause â resume R}hÉÎ - µÄµ§È¯úðsíÈ¢æ¤É·éD - - -rewind msec - ÊíCadintool Í pause ÌüÍð³·éD±Ì½ßCpause É - ü Í ª J n ³ êC»ÌüͪI¹·éOÉT[oR}hÉæè - resume µ½Æ«ÉÉ[ÉZ¢¹ºtOgªo³êé±ÆÆ È - éD±ÌIvVðwè·é±ÆÅCresumeɹºüÍÌêC - üÍð resume _©çwèµ½·³¾¯kÁÄM·éD - -EXAMPLE +EXAMPLES }CN©ç̹ºüÍðCb²ÆÉ "data.0000.wav" ©çÉL^·éF - - % adintool -in mic -out file -filename data - - åÈû^¹ºt@C "foobar.raw"ð¹ºæÔ²Æ É "foobar.1500.wav" - "foobar.1501.wav" ... ɪ·éF - - % adintool -in file -out file -filename foobar - -startid 1500 - (N®ãvvgÉεÄt@C¼ðüÍ) - enter filename->foobar.raw - + åÈû^¹ºt@C "foobar.raw" ðC¹ºæÔ²ÆÉ + "foobar.1500.wav" "foobar.1501.wav" ... ɪ·éF lbg[NoRŹºt@Cð]·é(æÔoȵ)F - - [óM¤] - % adintool -in adinnet -out file -nosegment - [M¤] - % adintool -in file -out adinnet -server hostname - -nosegment - - }CN©çÌü͹ºðÊT[o[Ì Julius ÉéF - - (1) üÍf[^ðSÄMµCJulius¤ÅæÔoEF¯F - - [Julius] - % julius -C xxx.jconf ... -input adinnet - [adintool] - % adintool -in mic -out adinnet -server hostname - -nosegment - - (2) üÍf[^ÍNCAg(adintool)¤ÅæÔoµCoµ½æÔ¾¯ - ðÉ Julius ÖMEF¯F - - [Julius] - % julius -C xxx.jconf ... -input adinnet - [adintool] - % adintool -in mic -out adinnet -server hostname + }CN©çÌü͹ºð Julius ÖMµÄF¯F SEE ALSO - julius(1), adinrec(1) - -BUGS - oOñEâ¢í¹ERg È Ç Í julius-info at lists.source- - forge.jp ÜŨ袵ܷD + julius ( 1 ) , + adinrec ( 1 ) COPYRIGHT - Copyright (c) 1991-2006 såw Í´¤º - Copyright (c) 2000-2005 ÞÇæ[ÈwZpåw@åw 줺 - Copyright (c) 2005-2006 ¼Ã®HÆåw JuliusJ`[ + Copyright (c) 1991-2008 såw Í´¤º + + Copyright (c) 1997-2000 îñU»Æ¦ï(IPA) + + Copyright (c) 2000-2008 ÞÇæ[ÈwZpåw@åw 줺 -AUTHORS - WL (¼Ã®HÆåw) ªÀµÜµ½D + Copyright (c) 2005-2008 ¼Ã®HÆåw JuliusJ`[ LICENSE Julius ÌgpøɶܷD -4.3 Berkeley Distribution LOCAL ADINTOOL(1) + 10/02/2008 ADINTOOL(1) Index: julius4/adintool/00readme.txt diff -u julius4/adintool/00readme.txt:1.2 julius4/adintool/00readme.txt:1.3 --- julius4/adintool/00readme.txt:1.2 Tue Dec 18 23:08:21 2007 +++ julius4/adintool/00readme.txt Thu Oct 2 17:23:45 2008 @@ -1,206 +1,214 @@ + adintool + ADINTOOL(1) ADINTOOL(1) NAME - adintool - audio tool to record/split/send/receive speech data for - Julius + adintool + - a tool to record / split / send / receive audio streams SYNOPSIS - adintool -in inputdev -out outputdev [options...] + adintool {-in inputdev} {-out outputdev} [options...] DESCRIPTION adintool analyzes speech input, finds speech segments skipping silence, - and records the detected segments in various ways. It performs speech - detection based on zerocross number and power (level), and records the + and records the detected segments in various ways. It performs speech + detection based on zerocross number and power (level), and records the detected parts to files or other output devices sucessively. - adintool is a highly-functioned version of adinrec. The supported - input device are: microphone input, a speech file, standard tty input, - and network socket (called adin-net server mode). The speech segments - are saved to output devices: speech files, standard tty output, and - network socket (called adin-net client mode). For example, you can - record the incoming speech segments to files with successively-numbered - suffixes, or send them to speech recognition engine julius to recognize - them. - - The output is not buffered: the receiver can get speech data with only - a slight delay after a speech starts. The speech detection algorithm - is as same as that of adinrec. - - Output format is WAV, 16bit (signed short), monoral. If the file - already exist, it will be overridden. - -INPUT - The input device should be specified by one of the following options: - - -in mic - Microphone input (default) - -in file - Speech data file. Supported format is RAW (16bit big endian) - and WAV (no compression) etc (depending on the compilation time - setting). - The input file name should be given later: prompt will appear - after startup. - - -in adinnet - Make adintool "adinnet server", waiting for connection from - adinnet client and receiving speech data from there via tcp/ip - socket. - Default port number is 5530, which can be altered by option - "-port". - - -in netaudio - If supported, get input data from NetAudio/Datlink server. Host - and unit name should be given with "-NA host:unit" option. - - -in stdin - Read speech data from standard tty input. Only RAW and WAV for- - mat is supported. - -OUTPUT - Specify one of these below to select an output device which the - detected speech segments are going to written to. - - -out file - Output to files. The base filename should be given by option - like "-filename foobar". Actually, the detected segments are - recorded in separate files such as "foobar.0000", "foobar.0001" - and so on. The four-digit ID begin with 0. This initial value - can be set explicitly by option "-startid". The output format - is WAV, 16bit signed. This can be changed by "-raw" option. - - -out adinnet - Make adintool "adinnet client", making connection to an adinnet - server on a host, and send speech data to the server. The host- - name should be specified by option "-server". The default port - number is 5530, which can be altered by option "-port". The - available adinnet server so far is adintool and Julius. + adintool is a upper version of adinrec with various functions. + Supported input device are: microphone input, a speech file, standard + tty input, and network socket (called adin-net server mode). Julius + plugin can be also used. Detected speech segments will be saved to + output devices: speech files, standard tty output, and network socket + (called adin-net client mode). For example, you can split the incoming + speech to segments and send them to Julius to be recognized. - -out stdout - Output to standard tty output in RAW, 16bit signed (big endian). + Output format is WAV, 16bit (signed short), monoral. If the file + already exist, it will be overridden. OPTIONS - -server host[,host...] - Server(s) to connect with "-out adinnet". With multiple server, - port number for each host should be specified by comma-separated - list. (default: 5530) - - -port num[,host...] - Port number to connect with "-out adinnet". Should be corre- - spond with "-server" - - -nosegment - Re-direct whole input speech data to output device, without - speech detection and segmentation. With this option, the output - filename does not have its four-digit ID appended. - - -oneshot - Record only the first speech segment. - - -freq threshold - Sampling frequency (Hz, default=16000) - - -48 Record in 48kHz, and down sampling to 16kHz. - - -lv threslevel - Level threshold (0-32767, default=2000) - - -zc zerocrossnum - Zero cross number threshold in a second (default=60) - - -headmargin msec - Header margin of each speech segment (unit: milliseconds) - (default: 400) - - -tailmargin msec - Tail margin of each speech segment (unit: milliseconds) - (default: 400) - - -nostrip - Disable skipping of invalid zero samples (default: enabled) - - -zmean Enable zero mean subtraction to remove DC offset. - - -raw Output in RAW (no header) 16bit, big engian format (default: - WAV) - - -autopause - Automatically pause at each input end. - - -loosesync - When connecting to multiple servers, avoid strict synchroniza- - tion for server-side pause and resume command. - - -rewind msec - By default, adintool will ignore speech input while being paused - by server-side command. This may be a problem if an input - begins while paused and then adintool resumes before the input - ends. This option will send the last msec inputs before resum- - ing. - -EXAMPLE - Record microphone input only for the speech-detected part in - "data.0000.wav", "data.0001.wav", ...: - - % adintool -in mic -out file -filename data - - Split a large speech data "foobar.raw" to "foobar.1500.wav", "foo- - bar.1501.wav", etc: - - % adintool -in file -out file -filename foobar - -startid 1500 - (enter the input filename after startup) - enter filename->foobar.raw - .... - - Send whole speech file to other host via tcp/ip socket: - - [sender] - % adintool -in adinnet -out file -nosegment - [receiver] - % adintool -in file -out adinnet -server hostname - -nosegment - - Send microphone input to Julius running on other host: - - (1) Transmit whole input, and let Julius execute - speech detection and recognition: - - [Julius] - % julius -C xxx.jconf ... -input adinnet - [adintool] - % adintool -in mic -out adinnet -server hostname - -nosegment - - (2) Detect speech segment at input client side - (adintool), and transmit only the detected parts - to Julius, and recognize them: - - [Julius] - % julius -C xxx.jconf ... -input adinnet - [adintool] - % adintool -in mic -out adinnet -server hostname + All Julius options can be set. Only audio input related options are + treated and others are silently skipped. Below is a list of options. + adintool specific options + -freq Hz + Set sampling rate in Hz. (default: 16,000) + + -in inputdev + Audio input device. "mic" to capture via microphone input, "file" + for audio file input, and "stdin" to read raw data from + standard-input. For file input, file name prompt will appear after + startup. Use "adinnet" to make adintool as "adinnet server", + receiving data from client via network socket. Default port number + is 5530, which can be altered by option "-inport". + + Alternatively, input device can be set by "-input" option, in which + case you can use plugin input. + + -out outputdev + Audio output device store the data. Specify "file" to save to file, + in which the output filename should be given by "-filename". Use + "stdout" to standard out. "adinnet" will make adintool to be an + adinnet client, sending speech data to a server via tcp/ip socket. + When using "adinnet" output, the server name to send data should be + specified by "-server". The default port number is 5530, which can + be changed by "-port" option. + + -inport num + When adintool becomes adinnet server to receive data (-in adinnet), + set the port number to listen. (default: 5530) + + -server [host] [,host...] + When output to adinnet server (-out adinnet), set the hostname. You + can send to multiple hosts by specifying their hostnames as + comma-delimited list like "host1,host2,host3". + + -port [num] [,num...] + When adintool send a data to adinnet server (-out adinnet), set the + port number to connect. (default: 5530) For multiple servers, + specify port numbers for all servers like "5530,5530,5531". + + -filename file + When output to file (-out file), set the output filename. The actual + file name will be as "file.0000.wav" , "file.0001.wav" and so on, + where the four digit number increases as speech segment detected. + The initial number will be set to 0 by default, which can be changed + by "-startid" option. When using "-oneshot" option to save only the + first segment, the input will be saved as "file". + + -startid number + At file output, set the initial file number. (default: 0) + + -oneshot + Exit after the end of first speech segment. + + -nosegment + Do not perform speech detection for input, just treat all the input + as a single valid segment. + + -raw + Output as RAW file (no header). + + -autopause + When output to adinnet server, adintool enter pause state at every + end of speech segment. It will restart when the destination adinnet + server sends it a resume signal. + + -loosesync + When output to multiple adinnet server, not to do strict + synchronization for restart. By default, when adintool has entered + pause state, it will not restart until resume commands are received + from all servers. This option will allow restart at least one + restart command has arrived. + + -rewind msec + When input is a live microphone device, and there has been some + continuing input at the moment adintool resumes, it start recording + backtracking by the specified milliseconds. + + Concerning Julius options + -input {mic|rawfile|adinnet|stdin|netaudio|esd|alsa|oss} + Choose speech input source. Specify 'file' or 'rawfile' for waveform + file. On file input, users will be prompted to enter the file name + from stdin. + + 'mic' is to get audio input from a default live microphone device, + and 'adinnet' means receiving waveform data via tcpip network from + an adinnet client. 'netaudio' is from DatLink/NetAudio input, and + 'stdin' means data input from standard input. + + At Linux, you can choose API at run time by specifying alsa, oss and + esd. + + -lv thres + Level threshold for speech input detection. Values should be in + range from 0 to 32767. (default: 2000) + + -zc thres + Zero crossing threshold per second. Only input that goes over the + level threshold (-lv) will be counted. (default: 60) + + -headmargin msec + Silence margin at the start of speech segment in milliseconds. + (default: 300) + + -tailmargin msec + Silence margin at the end of speech segment in milliseconds. + (default: 400) + + -zmean + This option enables DC offset removal. + + -smpFreq Hz + Set sampling rate in Hz. (default: 16,000) + + -48 + Record input with 48kHz sampling, and down-sample it to 16kHz + on-the-fly. This option is valid for 16kHz model only. The + down-sampling routine was ported from sptk. (Rev. 4.0) + + -NA devicename + Host name for DatLink server input (-input netaudio). + + -adport port_number + With -input adinnet, specify adinnet port number to listen. + (default: 5530) + + -nostrip + Julius by default removes successive zero samples in input speech + data. This option stop it. + + -C jconffile + Load a jconf file at here. The content of the jconffile will be + expanded at this point. + + -plugindir dirlist + Specify which directories to load plugin. If several direcotries + exist, specify them by colon-separated list. + +ENVIRONMENT VARIABLES + ALSADEV + (using mic input with alsa device) specify a capture device name. If + not specified, "default" will be used. + + AUDIODEV + (using mic input with oss device) specify a capture device path. If + not specified, "/dev/dsp" will be used. + + LATENCY_MSEC + Try to set input latency of microphone input in milliseconds. + Smaller value will shorten latency but sometimes make process + unstable. Default value will depend on the running OS. + +EXAMPLES + Record microphone input to files: "data.0000.wav", "data.0001.wav" and + so on: + Split a long speech file "foobar.raw" into "foobar.1500.wav", + "foobar.1501.wav" ...: + Copy an entire audio file via network socket. + Detect speech segment, send to Julius via network and recognize it: SEE ALSO - julius(1), adinrec(1) + julius ( 1 ) , + adinrec ( 1 ) COPYRIGHT - Copyright (c) 1991-2007 Kawahara Lab., Kyoto University - Copyright (c) 2001-2007 Shikano Lab., Nara Institute of Science and + Copyright (c) 1997-2000 Information-technology Promotion Agency, Japan + + Copyright (c) 1991-2008 Kawahara Lab., Kyoto University + + Copyright (c) 2000-2005 Shikano Lab., Nara Institute of Science and Technology - Copyright (c) 2005-2007 Julius project team, Nagoya Institute of Tech- - nology -AUTHORS - LEE Akinobu (Nagoya Institute of Technology, Japan) - contact: juliu****@lists***** + Copyright (c) 2005-2008 Julius project team, Nagoya Institute of + Technology LICENSE - Same as Julius. + The same as Julius. -4.3 Berkeley Distribution LOCAL ADINTOOL(1) + 10/02/2008 ADINTOOL(1) Index: julius4/adintool/adintool.man diff -u julius4/adintool/adintool.man:1.2 julius4/adintool/adintool.man:removed --- julius4/adintool/adintool.man:1.2 Tue Dec 18 23:08:21 2007 +++ julius4/adintool/adintool.man Thu Oct 2 17:23:45 2008 @@ -1,193 +0,0 @@ -.de Sp -.if t .sp .5v -.if n .sp -.. -.de Ip -.br -.ie \\n.$>=3 .ne \\$3 -.el .ne 3 -.IP "\\$1" \\$2 -.. -.TH ADINTOOL 1 LOCAL -.UC 6 -.SH NAME -adintool - audio tool to record/split/send/receive speech data for Julius -.SH SYNOPSIS -.B adintool -in inputdev -out outputdev [options...] -.SH DESCRIPTION -.I adintool -analyzes speech input, finds speech segments skipping silence, and -records the detected segments in various ways. It performs speech -detection based on zerocross number and power (level), and records the -detected parts to files or other output devices sucessively. -.PP -.I adintool -is a highly-functioned version of -.I adinrec. -The supported input device are: microphone input, a speech file, -standard tty input, and network socket (called adin-net server mode). -The speech segments are saved to output devices: speech files, -standard tty output, and network socket (called adin-net client mode). -For example, you can record the incoming speech segments to files with -successively-numbered suffixes, or send them to speech recognition -engine -.I julius -to recognize them. -.PP -The output is not buffered: the receiver can get speech data with only -a slight delay after a speech starts. The speech detection algorithm -is as same as that of -.I adinrec. -.PP -Output format is WAV, 16bit (signed short), monoral. -If the file already exist, it will be overridden. -.SH INPUT -The input device should be specified by one of the following options: -.Ip "\-in mic" -Microphone input (default) -.Ip "\-in file" -Speech data file. Supported format is RAW (16bit big endian) and WAV -(no compression) etc (depending on the compilation time setting). -.br -The input file name should be given later: prompt will appear after startup. -.Ip "\-in adinnet" -Make -.I adintool -"adinnet server", waiting for connection from adinnet client and -receiving speech data from there via tcp/ip socket. -.br -Default port number is 5530, which can be altered by option "-port". -.Ip "\-in netaudio" -If supported, get input data from NetAudio/Datlink server. Host and -unit name should be given with "-NA host:unit" option. -.Ip "\-in stdin" -Read speech data from standard tty input. Only RAW and WAV format is -supported. -.SH OUTPUT -Specify one of these below to select an output device which the detected -speech segments are going to written to. -.Ip "\-out file" -Output to files. The base filename should be given by option like -"-filename foobar". Actually, the detected segments are recorded in -separate files such as "foobar.0000", "foobar.0001" and so on. The -four-digit ID begin with 0. This initial value can be set explicitly -by option "-startid". The output format is WAV, 16bit signed. This -can be changed by "-raw" option. -.Ip "\-out adinnet" -Make -.I adintool -"adinnet client", making connection to an adinnet server on a host, -and send speech data to the server. The hostname should be specified -by option "-server". The default port number is 5530, which can be -altered by option "-port". -The available adinnet server so far is -.I adintool -and -.I Julius. -.Ip "\-out stdout" -Output to standard tty output in RAW, 16bit signed (big endian). -.SH OPTIONS -.Ip "\-server host[,host...]" -Server(s) to connect with "-out adinnet". With multiple server, port -number for each host should be specified by comma-separated -list. (default: 5530) -.Ip "\-port num[,host...]" -Port number to connect with "-out adinnet". Should be correspond with "-server" -.Ip "\-nosegment" -Re-direct whole input speech data to output device, without speech -detection and segmentation. With this option, the output filename -does not have its four-digit ID appended. -.Ip "\-oneshot" -Record only the first speech segment. -.Ip "\-freq threshold" -Sampling frequency (Hz, default=16000) -.Ip "\-48" -Record in 48kHz, and down sampling to 16kHz. -.Ip "\-lv threslevel" -Level threshold (0-32767, default=2000) -.Ip "\-zc zerocrossnum" -Zero cross number threshold in a second (default=60) -.Ip "\-headmargin msec" -Header margin of each speech segment (unit: milliseconds) -(default: 400) -.Ip "\-tailmargin msec" -Tail margin of each speech segment (unit: milliseconds) -(default: 400) -.Ip "\-nostrip" -Disable skipping of invalid zero samples (default: enabled) -.Ip "\-zmean" -Enable zero mean subtraction to remove DC offset. -.Ip "\-raw" -Output in RAW (no header) 16bit, big engian format (default: WAV) -.Ip "\-autopause" -Automatically pause at each input end. -.Ip "\-loosesync" -When connecting to multiple servers, avoid strict synchronization -for server-side pause and resume command. -.Ip "\-rewind msec" -By default, adintool will ignore speech input while being paused by -server-side command. This may be a problem if an input begins while -paused and then adintool resumes before the input ends. This option -will send the last msec inputs before resuming. -.SH "EXAMPLE" -Record microphone input only for the speech-detected part in -"data.0000.wav", "data.0001.wav", ...: -.PP - % adintool -in mic -out file -filename data -.PP -Split a large speech data "foobar.raw" to "foobar.1500.wav", "foobar.1501.wav", etc: -.PP - % adintool -in file -out file -filename foobar - -startid 1500 - (enter the input filename after startup) - enter filename->foobar.raw - .... -.PP -Send whole speech file to other host via tcp/ip socket: -.PP - [sender] - % adintool -in adinnet -out file -nosegment -.BR - [receiver] - % adintool -in file -out adinnet -server hostname - -nosegment -.PP -Send microphone input to -.I Julius -running on other host: -.PP -(1) Transmit whole input, and let Julius execute - speech detection and recognition: -.PP - [Julius] - % julius -C xxx.jconf ... -input adinnet -.BR - [adintool] - % adintool -in mic -out adinnet -server hostname - -nosegment -.PP -(2) Detect speech segment at input client side - (adintool), and transmit only the detected parts - to Julius, and recognize them: -.PP - [Julius] - % julius -C xxx.jconf ... -input adinnet -.BR - [adintool] - % adintool -in mic -out adinnet -server hostname - -.SH "SEE ALSO" -julius(1), adinrec(1) -.SH COPYRIGHT -Copyright (c) 1991-2007 Kawahara Lab., Kyoto University -.br -Copyright (c) 2001-2007 Shikano Lab., Nara Institute of Science and Technology -.br -Copyright (c) 2005-2007 Julius project team, Nagoya Institute of Technology -.SH AUTHORS -LEE Akinobu (Nagoya Institute of Technology, Japan) -.br -contact: juliu****@lists***** -.SH LICENSE -Same as -.I Julius. Index: julius4/adintool/adintool.man.ja diff -u julius4/adintool/adintool.man.ja:1.2 julius4/adintool/adintool.man.ja:removed --- julius4/adintool/adintool.man.ja:1.2 Tue Dec 18 23:08:21 2007 +++ julius4/adintool/adintool.man.ja Thu Oct 2 17:23:45 2008 @@ -1,193 +0,0 @@ -.de Sp -.if t .sp .5v -.if n .sp -.. -.de Ip -.br -.ie \\n.$>=3 .ne \\$3 -.el .ne 3 -.IP "\\$1" \\$2 -.. -.TH ADINTOOL 1 LOCAL -.UC 6 -.SH NAME -adintool - audio tool to record/split/send/receive speech data for Julius -.SH SYNOPSIS -.B adintool -in inputdev -out outputdev [options...] -.SH DESCRIPTION -.I adintool -は,音声波形データ中の音声区間の検出および記録を連続的に行うツールです. -入力音声に対して零交差数と振幅レベルに基づく音声区間検出を逐次行い, -音声区間部分を連続出力します. -.PP -.I adintool -は -.I adinrec -の高機能版です.音声データの入力元として,マイク入力・音声波形ファイル・ -標準入力・ネットワーク入力(adinnet サーバーモード)が選択できます.また, -出力先として,音声波形ファイル・標準出力・ネットワーク出力(adinnet ク -ライアントモード)が選択できます.特にネットワーク出力(adinnet クライ -アントモード)では, -.I julius -へネットワーク経由で音声を送信して音声認識させることができます. -.PP -入力音声は音声区間ごとに自動分割され,逐次出力されます. -音声区間の切り出しには -.I adinrec -と同じ,一定時間内の零交差数とパワー(振幅レベル)のしきい値を用います. -音声区間開始と同時に音声出力が開始されます.出力としてファイル出力を選 -んだ場合は,連番ファイル名で検出された区間ごとに保存します. -.PP -サンプリング周波数は任意に設定可能です.形式は 16bit monoral です.書き出されるデータ形式は WAV, 16bit, モノラルです. -既に同じ名前のファイルが存在する場合は上書きします. -.SH INPUT -音声を読み込む入力デバイスは以下のうちどれかを指定します. -.Ip "\-in mic" -マイク入力(デフォルト). -.Ip "\-in file" -音声波形ファイル.形式は RAW (16bit big endian),WAV(無圧縮)など -(コンパイル時の設定による). -.br -なお,入力ファイル名は起動後に,プロンプトに対して入力する. -.Ip "\-in adinnet" -adinnet サーバーとなってネットワーク経由で adinnet クライアントから音声データを受け取る. -adinnet クライアントからのTCP/IP接続を待ち, -接続が確立した後は adinnet クライアントから音声データを受け取る. -.br -ポート番号のデフォルトは 5530 である.これはオプション "-port" で変更可能. -.Ip "\-in netaudio" -(サポートされていれば)音声データをNetaudio/DatLinkサーバーから受け取 -る.サーバのホスト名とユニット名を "-NA host:unit" で指定する必要があ -る. -.Ip "\-in stdin" -標準入力.音声データ形式は RAW, WAV のみ. -.SH OUTPUT -検出した音声区間の音声データを書き出す出力デバイスとして,以下のうちど -れかを指定します. -.Ip "\-out file" -ファイルへ出力する.出力ファイル名は別のオプション"-filename foobar" -の形で与える.実際には "foobar.0000" , "foobar.0001" ... のように区間ごとに,指 -定した名前の末尾に4桁のIDをつけた名前で記録されなる. ID は 0 -を初期値として,音声区間検出ごとに1増加する.初期値はオプション -"-startid" で変更可能である.また,出力ファイル形式は 16bit WAV 形式である. -RAW 形式で出力するには "-raw" オプションを指定する. -.Ip "\-out adinnet" -adinnet クライアントとなって,ネットワーク経由で adinnet サーバへ音声 -データを送る.入力の時とは逆に, -.I adintool -は adinnet クライアントとなり,adinnet サーバーへ接続後,音声データを -送信する.adinnet サーバーとしては, -.I adintool -および -.I Julius -のadinnet入力が挙げられる. -.br -"-server" で送信先のadinnetサーバのホスト名を指定する.また -ポート番号のデフォルトは 5530 である.これはオプション "-port" で変更可能. -.Ip "\-out stdout" -標準出力へ出力する.形式は RAW, 16bit signed (big endian) である. -.PP -.SH OPTIONS -.Ip "\-server host[,host...]" -"-out adinnet" で送信するサーバのホスト名を指定する.コンマ区切りで -複数のサーバを指定可能. -.Ip "\-port num[,host...]" -"-out adinnet" で送信するサーバのポート番号.複数サーバ使用時は -コンマ区切りで "-server"に対応させて全ての番号を指定すること. (default: 5530) -.Ip "\-nosegment" -入力音声に対して音声区間の検出を行わず,そのまま出力へリダイレクトする. -ファイル出力の場合,ファイル名の末尾に4桁のIDは付与されなくなる. -.Ip "\-oneshot" -入力開始後,一番最初の1音声区間のみを送信後,終了する. -.Ip "\-freq threshold" -サンプリング周波数.単位は Hz (default: 16000) -.Ip "\-48" -48kHzサンプリング後,16kHzにダウンサンプリングする. -.Ip "\-lv threslevel" -波形の振幅レベルのしきい値 (0 - 32767).(default: 2000). -.Ip "\-zc zerocrossnum" -1秒あたりの零交差数のしきい値 (default: 60) -.Ip "\-headmargin msec" -音声区間開始部の直前のマージン.単位はミリ秒 -(default: 400) -.Ip "\-tailmargin msec" -音声区間終了部の直後のマージン.単位はミリ秒 -(default: 400) -.Ip "\-nostrip" -無効な 0 サンプルの自動除去を行わないようにする. -デフォルトは自動除去を行う. -.Ip "\-zmean" -DC成分除去を行う. -.Ip "\-raw" -ファイル出力形式を RAW, 16bit signed (big endian) にする. -デフォルトは WAV 形式である. -.Ip "\-autopause" -入力終了ごとに自動的に pause 状態に入る. -.Ip "\-loosesync" -複数サーバ接続時,複数サーバからの pause や resume コマンドに対して -厳密な同期を行わないようにする. -.Ip "\-rewind msec" -通常,adintool は pause 中の入力を無視する.このため,pause中に入力が開始 -され,その入力が終了する前にサーバコマンドにより resume したときに -極端に短い音声フラグメントが検出されることとなる.このオプションを -指定することで,resume時に音声入力中の場合,入力を resume 時点から -指定した長さだけ遡って送信する. -.SH "EXAMPLE" -マイクからの音声入力を,発話ごとに "data.0000.wav" から順に記録する: -.PP - % adintool -in mic -out file -filename data -.PP -巨大な収録音声ファイル "foobar.raw"を音声区間ごとに -"foobar.1500.wav" "foobar.1501.wav" ... に分割する: -.PP - % adintool -in file -out file -filename foobar - -startid 1500 - (起動後プロンプトに対してファイル名を入力) - enter filename->foobar.raw -.PP -ネットワーク経由で音声ファイルを転送する(区間検出なし): -.PP - [受信側] - % adintool -in adinnet -out file -nosegment -.BR - [送信側] - % adintool -in file -out adinnet -server hostname - -nosegment -.PP -マイクからの入力音声を別サーバーの -.I Julius -に送る: -.PP -(1) 入力データを全て送信し,Julius側で区間検出・認識: -.PP - [Julius] - % julius -C xxx.jconf ... -input adinnet -.BR - [adintool] - % adintool -in mic -out adinnet -server hostname - -nosegment -.PP -(2) 入力データはクライアント(adintool)側で区間検出し,検出した区間だけ -を順に Julius へ送信・認識: -.PP - [Julius] - % julius -C xxx.jconf ... -input adinnet -.BR - [adintool] - % adintool -in mic -out adinnet -server hostname -.SH "SEE ALSO" -julius(1), adinrec(1) -.SH BUGS -バグ報告・問い合わせ・コメントなどは -juli****@lists***** までお願いします. -.SH COPYRIGHT -Copyright (c) 1991-2006 京都大学 河原研究室 -.br -Copyright (c) 2000-2005 奈良先端科学技術大学院大学 鹿野研究室 -.br -Copyright (c) 2005-2006 名古屋工業大学 Julius開発チーム -.SH AUTHORS -李 晃伸 (名古屋工業大学) が実装しました. -.SH LICENSE -.I Julius -の使用許諾に準じます.