Wikipedia & Linterweb

7 December 2009

Okawix and openZIM:

Filed under: okawix — Matthieu @ 18:16

We spent the week end of November, 22th in Germany, where took place an openZIM developer meeting.

The openZIM project aims to define a new standard for the archiving of Wikimedia content. Based on the ZIM standard, the openZIM format will be open source, so that the users will be given the possibility to browse ZIM files with a variety of ZIM able readers.

Okawix currently uses the Zeno format, which has been a precursor to the ZIM format. Our participation in this meeting aimed to decide if it could be of interest to us to integrate the ZIM format into Okawix, or even to switch from Zeno to ZIM in all of our Wikipedia-related applications.

Indeed, in addition to the development of Okawix, our company, Linterweb, also makes available for download archives of the various Wikipedia projects, in all languages. Presently, those archives are available uniquely as Zeno files, either directly through the Okawix user interface or as BitTorrent links. A switch to an exclusive use of the ZIM format would therefore mean much more to us than a mere integration into Okawix, it would mean deep, substantial changes to the architecture of our applications.

However, after careful consideration, we decided to take the plunge: not only are we going to integrate the ZIM format into Okawix, but we are also going to generate and make available ZIM files. Of course, good things take time, and such changes won’t happen overnight, but the integration into Okawix of the ZIM format should be complete by the end of the year, and ZIM archives of Wikipedia should be available for download by the end of the first trimester 2010.

The openZIM project is approved by the Wikimedia Foundation; it will thus be, after our collaboration with TranslateWiki, our second collaboration with a project supported by Wikimedia.

Yours sincerely, Guillaume, Linterweb developer.


Linterweb is a web company that, for now several years, has been developing various Wikipedia oriented programs, including:

  • Wikiwix, a semantic web search engine that gives only results out of the databases of the Wikimedia Foundation’s projects; My Wikiwix, your own search engine for your own website; wikiwix.mobi, a mobile version of Wikiwix;
  • Okawix, the offline Wikipedia browser free of copyrights and free of charge that allows you to read offline archives of the articles of the various Wikimedia Foundation projects, as well as archives of your own website; those archives are available for download on the website of Okawix;
  • a DVD of around 2000 articles from the English speaking Wikipedia; a USB flash drive that contains the version 0.7 of the English speaking Wikipedia;
  • a program that archives the external web pages of the Wikipedia articles (that is, the web pages outside Wikipedia but linked from a Wikipedia article), so that their content remain available and that those external links don’t get broken; this program is used automatically, in particular, for all external links of the French speaking Wikipedia.

11 Comments »

  1. bonjour,

    WikiTaxi propose des mises à jours regulières vu que l’on recupère les xml directement. J’apprecie votre application et la préfère à wikitaxi. Mais quand est t’il des mises à jours des données ? Peut t’on recuperer directement les xml et les transformer en zeno ?

    merci

    Comment by franck — 9 December 2009 @ 12:02

  2. Bonjour,
    Les mises à jour des .okawix se font automatiquement et régulièrement, à peu près tous les 4 mois. Non, les zenos sont créés à partir des données en cache de wikiwix.com, nous n’avons rien pour transformer les dumps xml de la fondation en zeno.
    Merci

    Comment by Pascal Martin — 9 December 2009 @ 14:26

  3. merci.

    Une autre question : je souhaite avoir okawix sur un disque dur externe ou sur un dvd ( comme je peux le faire avec wikitaxi). J’ai essayé de portabiliser okawix mais sans succès. Pensez vous le portabiliser ?

    Comment by franck — 9 December 2009 @ 16:43

  4. Lots of of people blog about this topic but you said really true words!

    Comment by audinaise — 12 December 2009 @ 6:22

  5. [...] is the original post:  Wikipedia & Linterweb Okawix et openZIM : By admin | category: wikipedia | tags: archivage-des, format-standard, lire-les, [...]

    Pingback by Wikipedia & Linterweb Okawix et openZIM : Wikipedian — 18 December 2009 @ 23:13

  6. great application! thanks for your work.
    ciao

    Comment by gnaffetto — 27 December 2009 @ 19:39

  7. Is the search function enabled in the Mac version 0.7? I’ve tried it on several computers and have yet to get it to search successfully..

    Comment by Gabriel Wyner — 29 December 2009 @ 18:32

  8. Salut à tous ! Je souhaiterais savoir comment visionner avec Okawix la version russe ( uk.wikipedia.okawix ) et la version chinoise ( zh.wikipedia.okawix ). J’ai téléchargé ces 2 versions avec les images ( 1,49Go pour la version russe et 2,62 Go pour la version chinoise) et je me retrouve pour ces 2 cas sur une page blanche. Mon ordinateur est pourtant configuré pour lire les caractères orientaux (XP Pro). Un Bug du logiciel ??? Si quelqu’un a une solution je suis preneur.
    Sinon pour les autres versions ( française, anglaise et espagnole ), c’est vraiment super !!!
    A bientôt les amis !!!

    Comment by Philippe — 11 February 2010 @ 18:29

  9. Bonjour Philippe,
    Tout les problèmes sur les dumps sont à faire sur cette page :
    http://www.okawix.com/beta-test.php?lang=zh pour le chinois, et plus généralement sur http://www.okawix.com/beta-test.php.
    Cordialement
    Pascal

    Comment by Pascal Martin — 12 February 2010 @ 10:16

  10. Merci beaucoup Pascal pour les Infos. En fait, je me suis trompé . C’est la version Ukrainienne qui ne fonctionne pas et non la version russe.
    Salut !!!

    Comment by Philippe — 12 February 2010 @ 20:54

  11. Pour la version Chinoise et Ukrainienne, seule la page d’accueil n’apparaît pas. Sinon dès que l’on effectue une recherche cela fonctionne à merveille.
    A +

    Comment by Philippe — 21 February 2010 @ 11:24

RSS feed for comments on this post. TrackBack URL

Leave a comment

Powered by WordPress