-- Dumb Question
Manage Crawls -> Options has forms that allow you to control what pages are crawled. i.e.,
things link the initial seed sites.
Page Options -> Crawl Time tab has a form that allows you to control file types crawled
as well as how pages should be crawled. i.e., things like the maximum number of bytes
downloaded, how to extract a summary from a page, etc. One of things you can select
on this page is which indexing plugins to use. The Word Filter Plugin allows you
to control how to process a page based on the words it contains. You can look
within the Yioop documentation to see the format for its configure page.
Best,
Chris
Manage Crawls -> Options has forms that allow you to control what pages are crawled. i.e.,<br>things link the initial seed sites.<br><br>Page Options -> Crawl Time tab has a form that allows you to control file types crawled<br>as well as how pages should be crawled. i.e., things like the maximum number of bytes<br>downloaded, how to extract a summary from a page, etc. One of things you can select<br>on this page is which indexing plugins to use. The Word Filter Plugin allows you<br>to control how to process a page based on the words it contains. You can look<br>within the Yioop documentation to see the format for its configure page.<br><br>Best,<br>Chris