=================================================================
   IISA ѥ

                            滳(takuya-n@is.aist-nara.ac.jp)
                                               last update 980113
=================================================================

 IISA Ȥϡ

SUFARY ѤʣΥեʸ뤿ѥġǤ
Webڡθ᡼θͥåȥ˥塼θѤǤ
Υɤǡ줾ƤޤΤǻͤˤƲ


() SUFARYƱUNIX Ǥưޤ󡥤ޤѤ
       ˤ perl 5 ɬפǤ

Ūˤϡ

SUFARY Ȥäž֥ǥåʸޤΤᡤ
Ϥ򤻤ˡڤʤɤδñˡǺǥå
ȤäƤ⡤٤ι⤤Ǥޤϡǥåκ
®ˤĤʤޤ(ŪʥġʤΤŪòġ
٤ǽ뤫⤷ޤ)̤Υ󥰤
 tf*idf Ȥޤ


 ɽ

 $SUFRAY: SUFARYѥåŸǥ쥯ȥ
 $IISA  : IISA Τǥ쥯ȥ (= $SUFARY/iisa ??)


 IISA ġ뷲

IISAġΤ줾λȤޤƤΥġϡ-h 
դƵưȻȤɽޤ

 iisa

󥸥ʬǤ

  usage: iisa [options] keyword
  options:
    -f ..     : set index name (MKIND)
    -s ..     : output separator (----------------------------)
    -nosort   : output without sorting (0)
    -not ..   : negative key word
    -ngram .. : N-gram mode & specify N=(0)
    -moz ..   : MOZ mode & specify mozdic=()

 mkind

ǥåġǤץ -m  Perl ɽǻ
ꤷޤ

 usage: mkind (options..) <dir>
 options:
   -m ...   : set input file name mask [.]$
   -o ...   : set output file name [MKIND]
   -p ...   : set parser [./splitter]
   -l       : output .ary for prefix search [0]
   -help    : show this help

 mgind

ǥåʻġǤǥåΥƥʥ󥹤Ѥ
ޤ

 usage: mgind (options..) <files..>
 options:
   -o ...   : sets output file name [MGIND]
   -help    : shows this help

 splitter

ǥåܤκġǤChaSen  kakasi Τ褦ʷ
ϴΤǤڤꡤn-gramڤꡤڤ(
ǲϴ MOZѤμɬ)򥵥ݡȤƤޤ

¸ǤϡǥåΥ moz < char < ngram(2) ν礭
ʤޤ֤ char > moz > ngram(2) ν˹®Ǥ
ޤꡤ2-gram ⡼ɤѤåȤޤ󡥸٤
ͤ moz μȤ⡼ɤǤ̤ѤǤ char 
⡼ɤǽʬǤ

  usage: splitter [options] < file
  options:
    -char     : character type mode (ON)
    -ngram .. : ngram mode (OFF N=4)
    -moz ..   : moz mode (OFF dic=)
    -help     : show this help

 num2nx

ɽǡեġǤ̤ɽκݡɤΤ褦
ǡɽΤꤹ뤿˻Ȥޤ

 usage:  num2nx [options]
 options:
   -f ... : set input/output file name [MKIND(.num/.nx,.npt)]
   -c ... : set filter []
   -help  : show this help

 xiisa

Perl/Tk Ѥ iisa GUI ǤPerl  Tk module ɬפǤ(
ɽ򤹤ˤܸѥå򤢤Ƥɬפޤ)ǽŪˤ
ޤޤԽʬǤΤǤ礦

 usage: xiisa (options..)
 options:
   -f ...     : set input file [MKIND]$
   -filter ...: set filter for viewer [cat]$
   -iisa ...  : use [./iisa]
   -help      : shows this help


 ¾Υġ

IISA Ȥʥġ뤬 $IISA/tools ˤĤޤ
IISAΤѤΥġʤΤǡŪǻȤˤϤΥġ
Ȥޤץ٤ΤΤʤΤǡŪ˻ȤͤŪ
ѹȤ褤Ǥ礦ʤPerl ǽ񤫤ƤΤϡ
եƬ "#!/usr/local/bin/perl" ʬŬꤷľɬ
ޤ

html2txt: HTMLեƬ 100ʸ٤Ф
iisa.cgi: IISA  cgi lapper ץ
serv:      server
client:   servѤ client


 (1) ᡼/NET˥塼θ

 MH Ȥäƥ᡼ɤǤޤξ᡼ ~/Mail/
γƥǥ쥯ȥ 1, 2, 3, ... Ƚ֤˥ե̾դ
¸ޤǽҤ٤ΤϡIISA ȤäƤΥե
򤹤λǤ

ޤǥåեޤ

  % $IISA/mkind -m '^\d+' -o mail ~/Mail

ȡ~/Mail ʲ(֥ǥ쥯ȥޤ)٤ƤΥե
ΤΤߤΥե̾ĥեоݤȤƥǥ
 mail.(ix,ix.ary,num) ޤ

ˡɽǡեޤե̾ȼ䤷Τǡ
Фͤܤɽ褦ˤޤ礦

  % $IISA/num2nx -f mail -c "|nkf -me | egrep '^Subject|^From'"

ǸǥåνϤǤޤȤϡ

  % $IISA/iisa -f mail 

ʤ

  % $IISA/xiisa -f mail

ʤꤷƸƲ


 (2) Web ڡ

HTMLեθϡФäʽ򤷤礬¿
ǡIISAˤԸ⤷ޤ󡥤⤷ IISA for WWW Τ
òΤ뤫⤷ޤ󤬡ϻȤʤΤǡġ

ʲϡǥ쥯ȥ /hogehoge/www/* ʲˤ HTMLե
оݤȤλǤ

ޤǥåեޤ

  % $IISA/mkind -m '\.html' -o www /hogehoge/www

ȡ/hogehoge/www ʲ(֥ǥ쥯ȥޤ)٤Ƥ
ե⡤ .html ǤΤоݤȤƥǥå 
www.(ix,ix.ary,num) ޤHTMLեγĥҤ htm 
ϡ-m '\.html?' ȤɤǤ礦

ˡɽǡեޤ

  % $IISA/num2nx -f www -c "|nkf -e | $IISA/tools/html2txt"

ޤǽ褿顤Ǥ뤫Ƥߤޤ礦

  % $IISA/iisa -f www 

ָפʬˤŬʸƤޤǤ
Ƥ顤ǥåκϴλǤ

Ȥϡiisa  CGI ǻȤ褦ˤǤץȤ 
iisa.cgi Ȥ PerlץȤѰդޤΤǡĶˤ碌
Ŭ˲¤ƻȤäƲΤޤޤǤ NOTǤޤΤ
餺

CGI ƤӽФ HTMLեˤϡΤ褦˵Ҥޤ

<form action="bin/iisa.cgi" method="post">
<input type=text size=30 name="input">
<input type=submit value="  ">
<input type=hidden name=start value="0">
</form>
