2018-12-19

Encoding error.

Hi Chris. Please help! Yioop v5.0.4. I scan three sites.
  • 1) https://www.asberg.ru/
  • Extracted Title is OK
  • Extracted Description is OK
  • Extracted Links Array is like [https://www.asberg.ru/comp-novostei/novuyu-lineyku-trekhfaznykh-ibp-easy-ups-3s.html] => .. Новая линейка трехфазных ИБП Easy UPS 3S от Schneider Electric
  • Cache is OK
  • 2) https://moscow.rt.ru/
  • Extracted Title is OK
  • Extracted Description is like .. Цифровые услуги и сервисы для вашей семьи Подробнее Будьте спокойны за
  • Extracted Links Array is OK
  • Cache is OK
  • 3) http://www.housecontrol.ru/material.php
  • This site has the uppercase meta-tag:
  • I have changed two lines in the file yioop-v5.0.4/src/library/LocaleFunctions.php in the function guessEncodingHtml: $reg = "/charset(\s*)=(\s*)(\'|\")?((\w|\-)+)(\'|\")?/iu"; mb_ereg_search($reg, "i");
  • "i" - this means no case sensitivity.
  • Now scanning site №3 works without errors. But I need your help with site №1 and site №2.
Irina.
(Edited: 2018-12-19)
Hi Chris. Please help! Yioop v5.0.4. I scan three sites. * '''1) https://www.asberg.ru/''' * Extracted Title is OK * Extracted Description is OK * Extracted Links Array is like <nowiki>[https://www.asberg.ru/comp-novostei/novuyu-lineyku-trekhfaznykh-ibp-easy-ups-3s.html] => .. Новая линейка трехфазных ИБП Easy UPS 3S от Schneider Electric</nowiki> * Cache is OK * '''2) https://moscow.rt.ru/''' * Extracted Title is OK * Extracted Description is like <nowiki>.. Цифровые услуги и сервисы для вашей семьи Подробнее Будьте спокойны за</nowiki> * Extracted Links Array is OK * Cache is OK * '''3) http://www.housecontrol.ru/material.php''' * This site has the uppercase meta-tag: <nowiki><META http-equiv=content-type content="TEXT/HTML; CHARSET=Windows-1251"></nowiki> * I have changed two lines in the file yioop-v5.0.4/src/library/LocaleFunctions.php in the function guessEncodingHtml: <nowiki>$reg = "/charset(\s*)=(\s*)(\'|\")?((\w|\-)+)(\'|\")?/iu";</nowiki> <nowiki>mb_ereg_search($reg, "i");</nowiki> * "i" - this means no case sensitivity. * Now scanning site №3 works without errors. But I need your help with site №1 and site №2. Irina.
2018-12-20

-- Encoding error
Hi Irina,
I just finished the semester here. I will try to see if I can fix the encoding issue for 1 and 2 tomorrow (California Dec 21).
Best,
Chris
Hi Irina, I just finished the semester here. I will try to see if I can fix the encoding issue for 1 and 2 tomorrow (California Dec 21). Best, Chris

-- Encoding error
OK. Thank you. I will wait for your results.
OK. Thank you. I will wait for your results.
2018-12-27

-- Encoding error
Hi Irina,
I tracked down and fixed the encoding issues you had above. I also incorporated your bug fix for 3 above into the master branch of yioop in the git repository. You can obtain the development branch of yioop by typing:
 git clone https://seekquarry.com/git/yioop.git
Alternatively, you can find different versions of yioop at:
 https://www.seekquarry.com/viewgit/?a=summary&p=yioop
If you want to try to modify version 5.0.4 rather than use the development branch, download the development branch and type:
 git diff c4376bb6 512267d0
to see the changes I needed to make to fix 1 and 2. They were mainly places where I had to unscrew up the UTF-8 encoding. Let me know if these instructions suffice, if not, I can try to write something longer.
Best,
Chris
Hi Irina, I tracked down and fixed the encoding issues you had above. I also incorporated your bug fix for 3 above into the master branch of yioop in the git repository. You can obtain the development branch of yioop by typing: git clone https://seekquarry.com/git/yioop.git Alternatively, you can find different versions of yioop at: https://www.seekquarry.com/viewgit/?a=summary&p=yioop If you want to try to modify version 5.0.4 rather than use the development branch, download the development branch and type: git diff c4376bb6 512267d0 to see the changes I needed to make to fix 1 and 2. They were mainly places where I had to unscrew up the UTF-8 encoding. Let me know if these instructions suffice, if not, I can try to write something longer. Best, Chris

-- Encoding error
Oh, thanks, Chris. I am very pleased!!!
Oh, thanks, Chris. I am very pleased!!!
2018-12-31

-- Encoding error
Hi Chris. Happy New Year and Merry Christmas! I have found one problem in the development branch. The link "Resume" in the menu "Manage Crawls" does not work. Http error 500 occurs. Please help!
Hi Chris. Happy New Year and Merry Christmas! I have found one problem in the development branch. The link "Resume" in the menu "Manage Crawls" does not work. Http error 500 occurs. Please help!
X