Apache Drill User

Keeping track of Apache Drill. From a geeks, for geeks.

0 notes &

May 2013 updates & heads-up

This month has been exciting so far and promises even more. Here some highlights and announcements:

  • Drill talks
    • We had a very nice Hadoop get together in Berlin on 8 May. Lot’s of questions and good (brain) food.
    • On 14 May I presented Drill at the London HUG and again, huge interest and great discussions. I think it has been recorded. Stay tuned.
    • Then, on 16 May I had a gig in Stuttgart where the slides are now also available.
    • Upcoming: this Friday, on 24 May I’ll give a Drill tutorial at the Cloud East in Cambridge, UK. I suppose there are still some tickets available.
  • Check out ROOT, a distributed query engine from CERN. These guys really rock
  • Again, progress from Julian re operators.

And, as usual: don’t forget to join us at the weekly G+ hangouts at 9am PST / 5pm GMT/ 18:00 CET to discuss progress and issues.

0 notes &

NoSQL matters 2013 in Cologne, Germany—lots of good discussions and great people around for the Apache Drill training day, thank you everybody involved and hope to ‘see’ you on the mailing list, on Twitter or F2F next time!

NoSQL matters 2013 in Cologne, Germany—lots of good discussions and great people around for the Apache Drill training day, thank you everybody involved and hope to ‘see’ you on the mailing list, on Twitter or F2F next time!

0 notes &

Status update April 2013

There are many things happening in parallel at the moment. We’re making great progress in terms of APIs and storage engines.

Also, there are a number of events where Drill is discussed. For example, I recently gave a status talk at the HUG Munich and if you want to start get active in the development, be it via code or test data and test queries, consider joining us on our weekly Google+ hangout at 9am PT/5pm UTC.

Here are the current work-in-progress items:

To get your daily news flash, consider following us on @ApacheDrill and subscribe to one of the mailing lists!

0 notes &

Upcoming Apache Drill talks in Europe

There are a number of talks and sessions about Apache Drill scheduled in the next two months:

1 note &

Apache Drill proposal for Hadoop Summit 2013

Hadoop Summit 2013, Amsterdam

I’ve submitted a session proposal about Apache Drill to the Hadoop Summit 2013 in Amsterdam, The Netherlands.

Please consider supporting the proposal by voting for it at:

http://hadoopsummit2013.uservoice.com/forums/185447-hadoop-futures/suggestions/3400977-understanding-the-value-and-architecture-of-apache

Note that the top vote getters in each track will automatically be added to the Hadoop Summit agenda, so please spread the word and let’s make this happen!

1 note &

Work on query plan design & front-end

Wow, what a bunch of activities in the past two weeks :)

First, work on the query plan has started and a number of important design considerations and decisions have been made.

Then, as of today, the first stab at a Drill front-end (a Web app based on bootstrap.js and jQuery) is available:

  • Check out a walk through video (8min) on YouTube, then
  • try the live demo and maybe
  • … clone and toy around with the source code on GitHub. Note: comments and feature requests are more than welcome!

Apache Drill front-end screen shot 10/2012

1 note &

Design meeting wrap-up & #apachedrill

The Apache Drill Design Meeting on 13 Sep was a huge success, with some 60 people attending. Now, Camuel Gilyadov shared his thoughts re the design considerations and Ted Dunning spoke about Drill on 19 Sep at the Chicago Hadoop User Group - check out the video and his slide set.

Note: we now have an official hashtag #apachedrill - for example, you can search on Twitter for it now.

And last but not least some heads-up: the homepage http://incubator.apache.org/drill/ will soon be online …

1 note &

Ramping up infrastructure & first design meeting

While more and more people are joining the mailing list, first design-related discussions have been taking place.

Design meeting. On 2012-09-13 there was a design meeting with some 60 attendees. Jason Frantz and Julian Hyde presented design guidelines and suggestions. Check out Jason’s slide set, Julian’s slide set and/or watch the meeting.

After the meeting, a number of people signed up to drive certain tasks:

Interface definitions:

  • Jason Frantz
  • Julien Le Dem

SQL front-end & optimization:

  • Nausher Cholavaram
  • Dan

Execution Engine, resource management:

  • Sreevaddi (initial website, jenkins setup, cms, etc.)
  • Paul O’Leary
  • Jason Frantz
  • stoney@gmail.com

Storage (formats, columnar, etc.):

  • Julien LeDem
  • Gera Shegalov
  • Paul O’Leary

Wire formats. Last week I raised an issue re supporting Thrift as a wire format. Turns out that protobuf seems to perform best {1}, {2}, {3} and hence the internal wire format will be protobuf, but Avro and Thrift might well be supported, externally.

Front-end? So, I’m currently working on an Apache Drill front-end (essentially, an HTML5/Ajax browser app like you might know from BigQuery).

1 note &

Start of a journey: Apache Drill

Some 10 days ago I stumbled upon the proposal for the Apache Drill Incubator group. Drill is a distributed system for interactive analysis of large-scale datasets, inspired by Google’s Dremel.

We’re talking about querying hundreds and thousand of billions of records in a few seconds, working against HDFS and supporting a number of nested data wire formats, including JSON, Avro, ProtoBuf, and Thrift.

Since then, quite some things have happened:

  • In a recent GDG meeting, Ryan Boyd from Google and Tomer Shiran from MapR talked about BigQuery - Google’s hosted-Dremel version - as well as about Drill’s positioning, requirements and design decisions. Check out the video from the meeting and Tomer’s slides - his talk starts roughly 1 hour into the video.
  • We now have an Wikipedia entry about Drill.
  • We’re hanging out on Freenode IRC channel #drill for informal discussions.
  • I shared my thoughts about Drill on my WoD blog.

This site, drill-user.org aims at documenting the development of Drill, from its early days of incubating to, hopefully, one day becoming an Apache top-level project and as successful as Hadoop. At least ;)

Consider joining us on this journey. Submissions and comments are more than welcome! For now you might as well sign up on the drill-dev@incubator.apache.org mailing list …

Cheers, Michael