2012-04-12

Dedicated server for search?.

Originally Posted By: swillis
Do I need to have a dedicated server for this this system or can all functions be performed on the same server as my Intranet site?

If so, what is the subject/function called to perform all functions on the same server as my Intranet site?
'''Originally Posted By: swillis''' Do I need to have a dedicated server for this this system or can all functions be performed on the same server as my Intranet site?<br><br>If so, what is the subject/function called to perform all functions on the same server as my Intranet site?

-- Dedicated server for search?
I wasn't sure what you were asking...
Are you asking if Yioop can be on the same server as your intranet site? Yes.
Are you asking whether Yioop when serving search results requires a separate daemon
to be running in addition to Apache? No.
Are you asking if Yioop can be called via an API from some other PHP application? Yes.
This is describe http://www.seekquarry.com/?c=main&p=doc ... #embedding
under the subsection on Accessing Yioop! via the Function API.

Best,
Chris
I wasn't sure what you were asking...<br>Are you asking if Yioop can be on the same server as your intranet site? Yes. <br>Are you asking whether Yioop when serving search results requires a separate daemon<br>to be running in addition to Apache? No. <br>Are you asking if Yioop can be called via an API from some other PHP application? Yes.<br>This is describe http://www.seekquarry.com/?c=main&p=doc ... #embedding<br>under the subsection on Accessing Yioop! via the Function API.<br><br>Best,<br>Chris
2014-05-14

-- Dedicated server for search?
Originally Posted By: anargeek77
I'm new to this site and SeekQuarry in particular. Anyway my basic question is can I search only within an intranet or local network? The search I am trying to make is just to search for pdf,doc and other files on our local network.
'''Originally Posted By: anargeek77''' I'm new to this site and SeekQuarry in particular. Anyway my basic question is can I search only within an intranet or local network? The search I am trying to make is just to search for pdf,doc and other files on our local network.

-- Dedicated server for search?
Yes, you can restrict Yioop to crawl just your intranet or local network. Right now Yioop can index both pdf and doc but not docx -- docx is actually
an easier format than doc, I just haven't gotten around to coding something for it. It is on my to do list.

Hope this helps.
Yes, you can restrict Yioop to crawl just your intranet or local network. Right now Yioop can index both pdf and doc but not docx -- docx is actually<br>an easier format than doc, I just haven't gotten around to coding something for it. It is on my to do list.<br><br>Hope this helps.

-- Dedicated server for search?
Originally Posted By: anargeek77
Crawling/Search steps. Obviously right now I think I'm doing something wrong with the way I wanted to it to run. So I put some pdf files on a folder under localhost (http://localhost:8888/docs/). (I'm using MAMP just for testing) Everything is running I read about managing crawl and setting the machines.
Then I started to test it to search a keyword in the pdf file example "Blue Origin" and didn't give me any results.

What did I miss? I hope you can help and thanks.
'''Originally Posted By: anargeek77''' Crawling/Search steps. Obviously right now I think I'm doing something wrong with the way I wanted to it to run. So I put some pdf files on a folder under localhost (http://localhost:8888/docs/). (I'm using MAMP just for testing) Everything is running I read about managing crawl and setting the machines.<br>Then I started to test it to search a keyword in the pdf file example &quot;Blue Origin&quot; and didn't give me any results.<br><br>What did I miss? I hope you can help and thanks.
2014-05-21

-- Dedicated server for search?
How did you set your Crawl Options?
Was PDF checked under Page Options?
How did you set your Crawl Options?<br>Was PDF checked under Page Options?
2014-05-27

-- Dedicated server for search?
Originally Posted By: milkingcowboy
PDF is checked under page options, CRAWL OPTIONS well correct me if I am wrong I put this(http://127.0.0.1:8888/docs) under sites to be crawled cause that is where my pdf files are. Are there other settings that I need to do? I put a bunch of pdf files under my docs folder and I can't search any words on those pdf files. I hope you can help me. Thanks!
'''Originally Posted By: milkingcowboy''' PDF is checked under page options, CRAWL OPTIONS well correct me if I am wrong I put this(http://127.0.0.1:8888/docs) under sites to be crawled cause that is where my pdf files are. Are there other settings that I need to do? I put a bunch of pdf files under my docs folder and I can't search any words on those pdf files. I hope you can help me. Thanks!

-- Dedicated server for search?
Originally Posted By: milkingcowboy
BTW I only get 0 urls or 0 crawled pages for it. Even though I let it crawl for an hour.
'''Originally Posted By: milkingcowboy''' BTW I only get 0 urls or 0 crawled pages for it. Even though I let it crawl for an hour.
2014-05-30

-- Dedicated server for search?
If you browse to http://127.0.0.1:8888/docs what do you see? If there are no links to your PDF documents Yioop won't be able to find them.
This could happen if DirectoryIndex was not set up in the Apache config file.
If you browse to http://127.0.0.1:8888/docs what do you see? If there are no links to your PDF documents Yioop won't be able to find them.<br>This could happen if DirectoryIndex was not set up in the Apache config file.
2014-08-17

-- Dedicated server for search?
I've now added docx support to yioop as of commit 63bdfae0. I've also improved support for newer office formats by no longer
using PHP's ZipArchive class and instead rolling my own PartialZipArchive class. This new class let's one extract information/files
from a zip archive (which all modern office documents are) which has only partiall been obtained (say, one just has the first 50000
or 100000 bytes of the document as in a typical Yioop range request).
I've now added docx support to yioop as of commit 63bdfae0. I've also improved support for newer office formats by no longer<br>using PHP's ZipArchive class and instead rolling my own PartialZipArchive class. This new class let's one extract information/files<br>from a zip archive (which all modern office documents are) which has only partiall been obtained (say, one just has the first 50000<br>or 100000 bytes of the document as in a typical Yioop range request).
X