Welcome, Guest. Register Now!
   
Mark Forums Read Mark Forums Read Mark Forums Read


Reply
 
LinkBack Thread Tools Display Modes
  #1 (permalink)  
Old 08-10-2007, 05:36 PM
Junior Member
 
Join Date: Aug 2007
Posts: 2
Post Zend_Search_Lucene + index created with Java Lucene

Hey !

I'm having some encoding problems with Zend_Search_Lucene and an index created with Java Lucene (v1.9 and 2.0.0). I use a java app to do the indexing because that's extremely fast, but when I try to retrieve the data stored in the index on my test website using Zend_Search_Lucene, all the special chars (french chars : é, ç, è, à, ù, etc.) are broken.

I tried to add some data to the index using Zend_Search_Lucene, and everything works fine. Only the fields I added to the index with my java app have their french chars broken.

Does anyone have that kind of problem ? Do I have to use an other version of Java Lucene ?

Any help would be greatly appreciated.

Regards,

Guillaume
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #2 (permalink)  
Old 08-14-2007, 11:37 AM
Junior Member
 
Join Date: Jul 2007
Posts: 7
Default

As far as i know, Zend_Search_Lucene supports UTF-8 so you should make sure that you index your data as UTF-8 data and convert it (when neccasry) from the correct encoding.

You could look in Luke (a java program which is a admin tool to mange your index) to your index if there all characters are okay to decide it's a Zend_Lucene problem.

Hope this wil help you
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #3 (permalink)  
Old 08-14-2007, 12:25 PM
Junior Member
 
Join Date: Aug 2007
Posts: 2
Default

Hey !

I checked my index with Luke, and it seems that you're right... all the french characters in my index are broken. It seems that my indexer doesn't do its job properly.

Thank you very much !

Regards,

Guillaume
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
Reply


Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On



All times are GMT. The time now is 05:22 PM.