Welcome to NeoOffice developer notes and announcements
NeoOffice
Developer notes and announcements
 
 

This website is an archive and is no longer active
NeoOffice announcements have moved to the NeoOffice News website


Support
· Forums
· NeoOffice Support
· NeoWiki


Announcements
· Twitter @NeoOffice


Downloads
· Download NeoOffice


  
NeoOffice :: View topic - Installer Update 5/4/05 -- Unicode support
Installer Update 5/4/05 -- Unicode support
 
   NeoOffice Forum Index -> NeoLight Development
View previous topic :: View next topic  
Author Message
OPENSTEP
The One
The One


Joined: May 25, 2003
Posts: 4752
Location: Santa Barbara, CA

PostPosted: Wed May 04, 2005 10:07 pm    Post subject: Installer Update 5/4/05 -- Unicode support

I just uploaded a new binary drop, v1.0.2. This version uses double quotes around filenames, which should allow for indexing of files in directory paths with spaces and single quotes in filenames. Probably will fail for double quotes in filenames.

I also changed everything to use full Unicode text extraction. This has fixed some errors involving UTF8 conversion in the plugin. It may also solve problems indexing foreign files. The text item content is now definitely showing up on an mdimport -d3 of files, so this text encoding problem may have been the source of failures on certain files as well.

New binary is at:

http://trinity.neooffice.org/downloads/neolight_installer.pkg.tgz

MD5Sum : 3b7f47fc47dae869f1f9beacd072d762

ed


Last edited by OPENSTEP on Thu May 05, 2005 9:06 am; edited 1 time in total
Back to top
schlesi
Oracle


Joined: Jun 07, 2003
Posts: 234
Location: near Cologne, Germany

PostPosted: Thu May 05, 2005 7:40 am    Post subject:

There's a typo in the link. The right one is:

http://trinity.neooffice.org/downloads/neolight_installer.pkg.tgz.

Thomas
Back to top
OPENSTEP
The One
The One


Joined: May 25, 2003
Posts: 4752
Location: Santa Barbara, CA

PostPosted: Thu May 05, 2005 9:07 am    Post subject:

I stink. Thanks for catching that. I fixed it up above. I think I got it right on the Wiki at least.

ed
Back to top
schlesi
Oracle


Joined: Jun 07, 2003
Posts: 234
Location: near Cologne, Germany

PostPosted: Thu May 05, 2005 9:29 am    Post subject:

I've tested with both, spaces in filepath and german special characters, and it works. Great Wink

Thomas
Back to top
OPENSTEP
The One
The One


Joined: May 25, 2003
Posts: 4752
Location: Santa Barbara, CA

PostPosted: Thu May 05, 2005 9:41 am    Post subject:

Can you run an mdimport -d3 on a file and check to see if you see a "kMDTextContent" tag with the text of the document in it? Does that text also retain special German characters?

ed
Back to top
schlesi
Oracle


Joined: Jun 07, 2003
Posts: 234
Location: near Cologne, Germany

PostPosted: Thu May 05, 2005 10:06 am    Post subject:

Ed,

the text in my file is:
Quote:
Die linden Lüfte sind erwacht,
sie säuseln und weben Tag und Nacht.


The output of the mdimport is
Quote:
SchlesisPowerbook:~ thomas$ mdimport -d3 test.sxw
2005-05-05 19:12:45.556 mdimport[755] Attributes of file '/Users/thomas/test.sxw' before import: {
"_kMDItemImporterCrashed" = <null>;
"com_apple_metadata_modtime" = 137005887;
kMDItemContentCreationDate = 2005-05-05 18:29:17 +0200;
kMDItemContentModificationDate = 2005-05-05 19:11:27 +0200;
kMDItemContentType = "org.neooffice.writer";
kMDItemContentTypeTree = ("org.neooffice.writer", "public.content", "public.data", "public.item");
kMDItemDisplayName = {"" = "test.sxw"; };
kMDItemKind = {"" = "OpenOffice.org Text"; };
}
2005-05-05 19:12:45.558 mdimport[755] Import '/Users/thomas/test.sxw' type 'org.neooffice.writer' using 'file://localhost/Library/Spotlight/neolight.mdimporter/'
/usr/bin/unzip -p "/Users/thomas/test.sxw" "content.xml"
/usr/bin/unzip -p "/Users/thomas/test.sxw" "meta.xml"
2005-05-05 19:12:45.642 mdimport[755] Sending attributes of '/Users/thomas/test.sxw' to server. Attributes: '{
"_kMDItemImporterCrashed" = <null>;
"com_apple_metadata_modtime" = 137005887;
kMDItemContentCreationDate = 2005-05-05 18:29:17 +0200;
kMDItemContentModificationDate = 2005-05-05 19:11:27 +0200;
kMDItemContentType = "org.neooffice.writer";
kMDItemContentTypeTree = ("org.neooffice.writer", "public.content", "public.data", "public.item");
kMDItemDisplayName = {"" = "test.sxw"; };
kMDItemKind = {"" = "OpenOffice.org Text"; };
kMDItemTextContent = "Die linden L\U00fcfte sind erwacht, sie s\U00e4useln und weben Tag und Nacht.";
}'


Thomas
Back to top
OPENSTEP
The One
The One


Joined: May 25, 2003
Posts: 4752
Location: Santa Barbara, CA

PostPosted: Thu May 05, 2005 10:13 am    Post subject:

Sweet! It looks like it got both the text content and the unicode chars for the accents. Gnarly.

Thanks for looking at that. I just didn't have time last night to see if it worked.

ed
Back to top
Display posts from previous:   
   NeoOffice Forum Index -> NeoLight Development All times are GMT - 7 Hours
Page 1 of 1

 
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum
You cannot attach files in this forum
You cannot download files in this forum

Powered by phpBB © 2001, 2005 phpBB Group

All logos and trademarks in this site are property of their respective owner. The comments are property of their posters, all the rest © Planamesa Inc.
NeoOffice is a registered trademark of Planamesa Inc. and may not be used without permission.
PHP-Nuke Copyright © 2005 by Francisco Burzi. This is free software, and you may redistribute it under the GPL. PHP-Nuke comes with absolutely no warranty, for details, see the license.