Distributed programming and data consistency
While there is a general perception that all large data processing problems in the world can easily be solved via memcached, a NoSQL database, a Map Reduce solution and thousands of off the shelve computers, there are already many alert signs that it is not so easy (the Twitter whale being just one of them).
There are different data consistency needs and not all space/time data processing graphs are the same.
After understanding how distributed computing is powerful, you may want to know how hard it can be, how to recognize the signs of hardship and have a broad view at the tools available to deal with it.
We will take a look at several data processing problems, separating the easy to partition from the harder ones by pointing to their tell signs. We will then iterate trough solution candidates and respective tradeoffs, from Map Reduce to Locality related optimizations.
If this still sounds to abstract, it means we will also be taking a broad look at (among others) what are and what is the use of:
* Processing Grids;
* Data Grids and "NoSQL" databases;
* MVCC databases, data consistency and compensation mechanisms;
* (The limitations of) Two/Three Phase Commits;
* Coordination, the Paxos Algorithm, Hadoop Zookeeper.
Portuguese
Paulo is a one-job-at-a-time freelance software developer based in Lisbon. Designing and developing software since 1986, he lived and worked abroad for a few years, was an Apache committer (http://commons.apache.org/contributors.html) and performed some international consulting gigs before (finally) becoming a father.
With several programming languages, tools and domains under his belt, he spent the last 10 years mostly developing high traffic web applications and large system integration solutions.
He is currently obsessed with distributed systems focusing on data consistency and data storage related performance while (and because of) leading the development of a "SOAP-to-Scripting (and back)" integration layer. (more)
Thursday, 3 of December of 2009, from 15:00 to 16:00
Alexandre Amaral de Carvalho
André Cruz
André dos Santos Cardoso
Artur Jorge Martins
Bruno Lopes
Bruno Morisson
Bruno Tavares
Caio Proiete
Carlos Rodrigues
Cláudio Valente
Diogo José Melo Alves
Filipe Manuel Lopes de Freitas
Filipe Varela
Flávio Martins
Francisco Lourenço
Gonçalo Araújo
Gonçalo Fortes
Gustavo Funke
Hugo Costa
Jean Figueiredo
João Duarte Gomes
João Eduardo Luís
João Nelas
João Paulo Martins Machado
João Poupino
João Santos
Jorge Filipe Teixeira
Jorge Moura
José Luís Baía Ribeiro
José Manuel Canelas
José Rodrigues da Mata Fernandes
José Vasco Fidalgo Patrício
Luis Neves
Luís Pedro Zamith de Passos Machado Ferreira
Manuel Silva
Marco Neves
Marcos Garcia
Mário Candeias
Mário Morgado
Miguel Figueiredo Mascarenhas Sousa Filipe
Nuno da Costa Ferro
Nuno Marçal
Paula Valenca
Paulo Alexandre Nunes Traça
Paulo Andrade
Pedro Frazão
Pedro Gomes
Pedro Trindade
Ricardo Ferreira
Ricardo Jorge Martins Piedade
Rui Filipe Rocha Leite
Sergio Freire
Sérgio Nunes
Teresa Barros
Tiago Sá
Tomé Duarte
TT*
Estimated head count: 99 people
(based on the total of persons interested in this talk and the universe of people attending Codebits)