Hacky Thursdays — Back to school Fall 2016

Summer is flying. It’s almost dark again outside when it’s Hacky Thursday’s time 🙂 After this rather long long summer break, it’s time to go back to school. The announcements of the summer was legions, and many, many, many new projects have been given to the Apache Software Foundation (ASF). I did challenge anyone of you […]

Google #HashCode Hub @EIA-FR

We’re pretty delighted to announce that EIA-FR has accepted to be an official Hub for the next Google #HashCode, Thursday Feb 11. Since Thursdays is our hacky day, let’s code something different (or not, we’ll see :)). When Thursday, February 11. Doors open at 6pm, and will close around midnight. What Google #HashCode Contest Where Room […]

DAPLAB Session #1: Zefix Notifier Project

As announced a few weeks back, we’ll change the layout of the weekly DAPLAB sessions and come up with broader subject, spanning on several weeks. The first session we’re proposing is called Zefix Notifier. In a nutshell, we’ll ingest data from the Central Business Names Index of Switzerland (Zentraler Firmenindex, or Zefix in short). We’ll also […]

Spark MLlib Hello World

This page aims at creating a “copy-paste”-like tutorial to run your first Spark MLlib script. Requirements SSH (for Windows, use PuTTY and see how to create a key with PuTTY) An account in the DAPLAB, and send your ssh public key to Benoit. A browser — well, if you can access this page, you should […]

Spark Hello World

A new tutorial is available on docs.daplab.ch. It will guide you through the basics of Apache Spark and its scala interpreter (spark-shell). Enjoy !

A new framework to simplify interaction with YARN: Apache Twill

YARN, aka NextGen MapReduce, is awesome for building fault-tolerant distributed applications. But writing plain YARN application is far than trivial and might even be a show-stopper to lots of engineers. The good news is that a framework to simplify interaction with YARN emerged and met the Apache foundation: Apache Twill. While still in the incubation phase, the project looks […]

HDFS Hello World

This page aims at creating a “copy-paste”-like tutorial to familiarize with HDFS commands . It mainly focuses on user commands (uploading and downloading data into HDFS). Requirements SSH (for Windows, use PuTTY and see how to create a key with PuTTY) An account in the DAPLAB, and send your ssh public key to Benoit. A browser — well, […]

Available dataset : homogeneous meteorological data

We give access to homogeneous monthly values of temperature and precipitation for 14 stations from 1864 until today. Yearly values are averaged for whole Switzerland Since 1864 and are now on the DAPLAB ! Data set Explanation The file is a .txt and contains a four rows headers. MeteoSchweiz / MeteoSuisse / MeteoSvizzera / MeteoSwiss […]