Delphi Inspiration – Delphi Components and Software Applications

Differences

This shows you the differences between two versions of the page.

wiki:wikitaxi:index [2010/02/22 19:25]
admin Explain that Spanish Wikipedia problem is indeed a Wikipedia exporter bug.
wiki:wikitaxi:index [2010/07/20 20:08] (current)
Line 14: Line 14:
  WikiTaxi.exe English.taxi "Albert Einstein"   WikiTaxi.exe English.taxi "Albert Einstein"
 +
 +===== Batch Importing =====
 +
 +Currently WikiTaxi Importer does not support command line arguments and you must pre-compress your XML files with bzip2. You can find a [[WikiTaxi Import Script]] written in AutoIT to save time compressing/importing.
===== Feature Suggestions ===== ===== Feature Suggestions =====
Line 23: Line 27:
> WikiTaxi is Win32 only. There are reports on the Internet that WikiTaxi runs well on Linux using the [[http://www.winehq.org|Wine compatability layer]]. Max OS X or Symbian are unsupported. > WikiTaxi is Win32 only. There are reports on the Internet that WikiTaxi runs well on Linux using the [[http://www.winehq.org|Wine compatability layer]]. Max OS X or Symbian are unsupported.
> >
->> Yes, just tested.  WikiTaxi works great (Import and all!). In Ubuntu install package with Synaptic Package Mgr. Then Wine appears on the bottom of the Applications Menu. Run "Browse C:\" Double Click on WikiTaxi Importer, select file and enter output database name. Click Execute. Close after finish. Go back to Wine C:\ browser, double click on WikiTaxi. Scroll down the opening page 1 screen and click on hyperlink "Open File" browse to the just created Wiki Database from Importer and you are in. Enter search term in top line. Thank you so much for making WikiTaxi! M +>> Yes, just tested.  WikiTaxi works great (Import and all!). In Ubuntu install package with Synaptic Package Mgr. Then Wine appears on the bottom of the Applications Menu. Run "Browse C:\" Double Click on WikiTaxi Importer, select file and enter output database name. Click Execute. Close after finish. Go back to Wine C:\ browser, double click on WikiTaxi. Scroll down the opening page 1 screen and click on hyperlink "Open File" browse to the just created Wiki Database from Importer and you are in. Enter search term in top line. Thank you so much for making WikiTaxi! M
** Full-text search ** ** Full-text search **
> Full text article search requires considerable drive space to store the inverted index. Since the database size is already quite large for some Wikis, there is no article search right now. However, this might change in a future version. > Full text article search requires considerable drive space to store the inverted index. Since the database size is already quite large for some Wikis, there is no article search right now. However, this might change in a future version.
- 
-** <MATH> formulas made with Tex are not displayed ** 
- 
-> Bzreader(another Wiki dump reader) adopted MimeTeX application to enable TeX formula display. It would be better for WikiTaxi to incorporate such similar applications. 
-> 
->> WikiTaxi does not have TeX support. It would be far too complex to implement in a single executable. Know a simple solution? Please share! 
** Some form of image support ** ** Some form of image support **
Line 49: Line 47:
> During WikiTaxi design I kept in mind downloading or updating selected Wiki pages from MediaWiki servers using the Special:Export extension, but this has not been implemented yet. > During WikiTaxi design I kept in mind downloading or updating selected Wiki pages from MediaWiki servers using the Special:Export extension, but this has not been implemented yet.
-** I'd like to see WikiTaxi_Importer.exe work on the command line (and possibly a linux-statically compiled version) to that I can automate the translation process **+** I'd like to see WikiTaxi_Importer.exe work on the command line (and possibly a Linux-statically compiled version) to that I can automate the translation process **
> Noted, but this is low priority only. > Noted, but this is low priority only.
Line 55: Line 53:
** Support for the [[http://meta.wikimedia.org/wiki/ParserFunctions|ParserFunctions]] extension ** ** Support for the [[http://meta.wikimedia.org/wiki/ParserFunctions|ParserFunctions]] extension **
-> WikiTaxi has ever since supported parser functions. If you find that a parser function is missing, please send an e-mail. In particular, the following are included. +> WikiTaxi supports most parser variables, magic words, and parser functions.
-+
->Core parser functions: anchorencode, DEFAULTSORT, DISPLAYTITLE, formatnum, fullurl, lc, lcfirst, ns, padleft, padright, #tag, uc, ucfirst, urlencode. +
-+
->Parser functions: #expr, #if, #ifeq, #iferror, #ifexist, #ifexpr, #switch, #time, #titleparts+
** Support for the importer to work with the uncompressed XML file ** ** Support for the importer to work with the uncompressed XML file **
Line 65: Line 59:
WikiMedia provides these as BZ2 but some wikis (like those on [[http://wikistats.wikia.com/dbdumps/dbdumps.html|Wikia]] and possibly others) provide it as GZ instead. It'd be great if you could just uncompress the XML and use it, rather than having to uncompress, then recompress as BZ2. WikiMedia provides these as BZ2 but some wikis (like those on [[http://wikistats.wikia.com/dbdumps/dbdumps.html|Wikia]] and possibly others) provide it as GZ instead. It'd be great if you could just uncompress the XML and use it, rather than having to uncompress, then recompress as BZ2.
-** Save pages to HTML files ** +** Save pages to HTML files **
Bonus if we can dump the whole contents of the .taxi out as a directory of HTML files for use on obscure platforms. Bonus if we can dump the whole contents of the .taxi out as a directory of HTML files for use on obscure platforms.
Line 105: Line 99:
Especially WP: as a short form for Wikipedia: doesn't work. Especially WP: as a short form for Wikipedia: doesn't work.
 +
 +> The common namespace aliases are implemented as of WikiTaxi 1.2.0. //Ralf//
** Add links to the Headlines (like the edit links in Wikipedia) to jump directly to that section in the source code ** ** Add links to the Headlines (like the edit links in Wikipedia) to jump directly to that section in the source code **
Line 125: Line 121:
> Third party applications or platforms would likely be better off importing the *.xml dump into their specific data structure. If you have a particular idea, you are welcome to share your thoughts at <mail@wikitaxi.org>. > Third party applications or platforms would likely be better off importing the *.xml dump into their specific data structure. If you have a particular idea, you are welcome to share your thoughts at <mail@wikitaxi.org>.
- 
===== Bug Reports ===== ===== Bug Reports =====
Line 134: Line 129:
The Wiki is good if it helps other people to know about a particular problem and especially if you can suggest a workaround. E-Mail is better for intricate problems which might ask for more details to reproduce. Thank you! The Wiki is good if it helps other people to know about a particular problem and especially if you can suggest a workaround. E-Mail is better for intricate problems which might ask for more details to reproduce. Thank you!
</note> </note>
- 
-** Links with percent encoding / URL encoding do not work **  
- 
-Links which contain percent / URL encoded characters (%22) do not work. 
- 
-> This fix requires rewriting substantial parts of WikiTaxi and will be addressed in a future version. 
** On Red-Hyperlinks ** ** On Red-Hyperlinks **
'Red'-hyperlinks (hyperlinks to lemmas that have not been written, i.e. hyperlinks it nowere) in Wikipedia are converted to 'blue'-hyperlinks by WikiTaxi. I would suggest the conversion of the red links to plain text. 'Red'-hyperlinks (hyperlinks to lemmas that have not been written, i.e. hyperlinks it nowere) in Wikipedia are converted to 'blue'-hyperlinks by WikiTaxi. I would suggest the conversion of the red links to plain text.
 +
 +> Not implemented right now for speed. //Ralf//
**Does not Import** **Does not Import**
Line 152: Line 143:
> >
> **Note:** I suspect that your problems result from corrupted *.xml.bz2 file downloads. Please verify their MD5 checksums against those listed on the Wikipedia download page. > **Note:** I suspect that your problems result from corrupted *.xml.bz2 file downloads. Please verify their MD5 checksums against those listed on the Wikipedia download page.
-> 
->> **Note:** Am just trying it now but it seems to make it work to "write the whole path of the .bz2 file manually" ! try it ! so do not use the browse button ! //Anonymous// 
->> 
->>> This seems to point to a Windows Vista file open dialog incompatibility which showed in an earlier version of WikiTaxi. I has since been fixed. Please download the latest version and try again. //WikiTaxi Developer// 
->>>>Still don't work in 1.0.6. on Vista. 
- 
-Same problem on Windows 7 build 7100. Browse button doesn't select file, and manually typing the path\file also fails. 
- 
->>>>>In Properties, choice to run win windows XP mode 
- 
-**problem with the last dump of wikipedia in spanish** 
- 
-I have a problem with the last dump of wikipedia in spanish, the page load and render with another pages inside, I think it's because wikipedia changed the name of the templates, for example when i load an article about a person in the field of age is another article instead the age. I don't know if I explain good, because write in english is too dificult for me now. Thanks in advance. 
**Spanish Wikipedia Problem** **Spanish Wikipedia Problem**
Line 179: Line 157:
>> >>
>>Unfortunately, no date has yet been schedule for a fix. For the time being, the recommended workaround is to download the ''pages-meta-current.xml.bz2'' file instead of the usual ''pages-articles.xml.bz2'' dump. //Ralf// >>Unfortunately, no date has yet been schedule for a fix. For the time being, the recommended workaround is to download the ''pages-meta-current.xml.bz2'' file instead of the usual ''pages-articles.xml.bz2'' dump. //Ralf//
- 
- 
wiki/wikitaxi/index.1266863100.txt.gz · Last modified: 2010/02/22 19:25 by admin