wiki:wikitaxi:index [2010/02/22 19:25] admin Explain that Spanish Wikipedia problem is indeed a Wikipedia exporter bug. |
wiki:wikitaxi:index [2010/07/20 20:08] (current) |
| | | | |
| | WikiTaxi.exe English.taxi "Albert Einstein" | | WikiTaxi.exe English.taxi "Albert Einstein" |
| | + | |
| | + | ===== Batch Importing ===== |
| | + | |
| | + | Currently WikiTaxi Importer does not support command line arguments and you must pre-compress your XML files with bzip2. You can find a [[WikiTaxi Import Script]] written in AutoIT to save time compressing/importing. |
| | | | |
| | ===== Feature Suggestions ===== | | ===== Feature Suggestions ===== |
| | > WikiTaxi is Win32 only. There are reports on the Internet that WikiTaxi runs well on Linux using the [[http://www.winehq.org|Wine compatability layer]]. Max OS X or Symbian are unsupported. | | > WikiTaxi is Win32 only. There are reports on the Internet that WikiTaxi runs well on Linux using the [[http://www.winehq.org|Wine compatability layer]]. Max OS X or Symbian are unsupported. |
| | > | | > |
| - | >> Yes, just tested. WikiTaxi works great (Import and all!). In Ubuntu install package with Synaptic Package Mgr. Then Wine appears on the bottom of the Applications Menu. Run "Browse C:\" Double Click on WikiTaxi Importer, select file and enter output database name. Click Execute. Close after finish. Go back to Wine C:\ browser, double click on WikiTaxi. Scroll down the opening page 1 screen and click on hyperlink "Open File" browse to the just created Wiki Database from Importer and you are in. Enter search term in top line. Thank you so much for making WikiTaxi! M | + | >> Yes, just tested. WikiTaxi works great (Import and all!). In Ubuntu install package with Synaptic Package Mgr. Then Wine appears on the bottom of the Applications Menu. Run "Browse C:\" Double Click on WikiTaxi Importer, select file and enter output database name. Click Execute. Close after finish. Go back to Wine C:\ browser, double click on WikiTaxi. Scroll down the opening page 1 screen and click on hyperlink "Open File" browse to the just created Wiki Database from Importer and you are in. Enter search term in top line. Thank you so much for making WikiTaxi! M |
| | | | |
| | ** Full-text search ** | | ** Full-text search ** |
| | | | |
| | > Full text article search requires considerable drive space to store the inverted index. Since the database size is already quite large for some Wikis, there is no article search right now. However, this might change in a future version. | | > Full text article search requires considerable drive space to store the inverted index. Since the database size is already quite large for some Wikis, there is no article search right now. However, this might change in a future version. |
| - | | |
| - | ** <MATH> formulas made with Tex are not displayed ** | |
| - | | |
| - | > Bzreader(another Wiki dump reader) adopted MimeTeX application to enable TeX formula display. It would be better for WikiTaxi to incorporate such similar applications. | |
| - | > | |
| - | >> WikiTaxi does not have TeX support. It would be far too complex to implement in a single executable. Know a simple solution? Please share! | |
| | | | |
| | ** Some form of image support ** | | ** Some form of image support ** |
| | > During WikiTaxi design I kept in mind downloading or updating selected Wiki pages from MediaWiki servers using the Special:Export extension, but this has not been implemented yet. | | > During WikiTaxi design I kept in mind downloading or updating selected Wiki pages from MediaWiki servers using the Special:Export extension, but this has not been implemented yet. |
| | | | |
| - | ** I'd like to see WikiTaxi_Importer.exe work on the command line (and possibly a linux-statically compiled version) to that I can automate the translation process ** | + | ** I'd like to see WikiTaxi_Importer.exe work on the command line (and possibly a Linux-statically compiled version) to that I can automate the translation process ** |
| | | | |
| | > Noted, but this is low priority only. | | > Noted, but this is low priority only. |
| | ** Support for the [[http://meta.wikimedia.org/wiki/ParserFunctions|ParserFunctions]] extension ** | | ** Support for the [[http://meta.wikimedia.org/wiki/ParserFunctions|ParserFunctions]] extension ** |
| | | | |
| - | > WikiTaxi has ever since supported parser functions. If you find that a parser function is missing, please send an e-mail. In particular, the following are included. | + | > WikiTaxi supports most parser variables, magic words, and parser functions. |
| - | > | + | |
| - | >Core parser functions: anchorencode, DEFAULTSORT, DISPLAYTITLE, formatnum, fullurl, lc, lcfirst, ns, padleft, padright, #tag, uc, ucfirst, urlencode. | + | |
| - | > | + | |
| - | >Parser functions: #expr, #if, #ifeq, #iferror, #ifexist, #ifexpr, #switch, #time, #titleparts | + | |
| | | | |
| | ** Support for the importer to work with the uncompressed XML file ** | | ** Support for the importer to work with the uncompressed XML file ** |
| | WikiMedia provides these as BZ2 but some wikis (like those on [[http://wikistats.wikia.com/dbdumps/dbdumps.html|Wikia]] and possibly others) provide it as GZ instead. It'd be great if you could just uncompress the XML and use it, rather than having to uncompress, then recompress as BZ2. | | WikiMedia provides these as BZ2 but some wikis (like those on [[http://wikistats.wikia.com/dbdumps/dbdumps.html|Wikia]] and possibly others) provide it as GZ instead. It'd be great if you could just uncompress the XML and use it, rather than having to uncompress, then recompress as BZ2. |
| | | | |
| - | ** Save pages to HTML files ** | + | ** Save pages to HTML files ** |
| | | | |
| | Bonus if we can dump the whole contents of the .taxi out as a directory of HTML files for use on obscure platforms. | | Bonus if we can dump the whole contents of the .taxi out as a directory of HTML files for use on obscure platforms. |
| | | | |
| | Especially WP: as a short form for Wikipedia: doesn't work. | | Especially WP: as a short form for Wikipedia: doesn't work. |
| | + | |
| | + | > The common namespace aliases are implemented as of WikiTaxi 1.2.0. //Ralf// |
| | | | |
| | ** Add links to the Headlines (like the edit links in Wikipedia) to jump directly to that section in the source code ** | | ** Add links to the Headlines (like the edit links in Wikipedia) to jump directly to that section in the source code ** |
| | | | |
| | > Third party applications or platforms would likely be better off importing the *.xml dump into their specific data structure. If you have a particular idea, you are welcome to share your thoughts at <mail@wikitaxi.org>. | | > Third party applications or platforms would likely be better off importing the *.xml dump into their specific data structure. If you have a particular idea, you are welcome to share your thoughts at <mail@wikitaxi.org>. |
| - | | |
| | | | |
| | ===== Bug Reports ===== | | ===== Bug Reports ===== |
| | The Wiki is good if it helps other people to know about a particular problem and especially if you can suggest a workaround. E-Mail is better for intricate problems which might ask for more details to reproduce. Thank you! | | The Wiki is good if it helps other people to know about a particular problem and especially if you can suggest a workaround. E-Mail is better for intricate problems which might ask for more details to reproduce. Thank you! |
| | </note> | | </note> |
| - | | |
| - | ** Links with percent encoding / URL encoding do not work ** | |
| - | | |
| - | Links which contain percent / URL encoded characters (%22) do not work. | |
| - | | |
| - | > This fix requires rewriting substantial parts of WikiTaxi and will be addressed in a future version. | |
| | | | |
| | ** On Red-Hyperlinks ** | | ** On Red-Hyperlinks ** |
| | | | |
| | 'Red'-hyperlinks (hyperlinks to lemmas that have not been written, i.e. hyperlinks it nowere) in Wikipedia are converted to 'blue'-hyperlinks by WikiTaxi. I would suggest the conversion of the red links to plain text. | | 'Red'-hyperlinks (hyperlinks to lemmas that have not been written, i.e. hyperlinks it nowere) in Wikipedia are converted to 'blue'-hyperlinks by WikiTaxi. I would suggest the conversion of the red links to plain text. |
| | + | |
| | + | > Not implemented right now for speed. //Ralf// |
| | | | |
| | **Does not Import** | | **Does not Import** |
| | > | | > |
| | > **Note:** I suspect that your problems result from corrupted *.xml.bz2 file downloads. Please verify their MD5 checksums against those listed on the Wikipedia download page. | | > **Note:** I suspect that your problems result from corrupted *.xml.bz2 file downloads. Please verify their MD5 checksums against those listed on the Wikipedia download page. |
| - | > | |
| - | >> **Note:** Am just trying it now but it seems to make it work to "write the whole path of the .bz2 file manually" ! try it ! so do not use the browse button ! //Anonymous// | |
| - | >> | |
| - | >>> This seems to point to a Windows Vista file open dialog incompatibility which showed in an earlier version of WikiTaxi. I has since been fixed. Please download the latest version and try again. //WikiTaxi Developer// | |
| - | >>>>Still don't work in 1.0.6. on Vista. | |
| - | | |
| - | Same problem on Windows 7 build 7100. Browse button doesn't select file, and manually typing the path\file also fails. | |
| - | | |
| - | >>>>>In Properties, choice to run win windows XP mode | |
| - | | |
| - | **problem with the last dump of wikipedia in spanish** | |
| - | | |
| - | I have a problem with the last dump of wikipedia in spanish, the page load and render with another pages inside, I think it's because wikipedia changed the name of the templates, for example when i load an article about a person in the field of age is another article instead the age. I don't know if I explain good, because write in english is too dificult for me now. Thanks in advance. | |
| | | | |
| | **Spanish Wikipedia Problem** | | **Spanish Wikipedia Problem** |
| | >> | | >> |
| | >>Unfortunately, no date has yet been schedule for a fix. For the time being, the recommended workaround is to download the ''pages-meta-current.xml.bz2'' file instead of the usual ''pages-articles.xml.bz2'' dump. //Ralf// | | >>Unfortunately, no date has yet been schedule for a fix. For the time being, the recommended workaround is to download the ''pages-meta-current.xml.bz2'' file instead of the usual ''pages-articles.xml.bz2'' dump. //Ralf// |
| - | | |
| - | | |