Ciao a tutti,
per quelli interessati a sapere di piu su Hadoop, mercoledi 9 terro' un seminario "hands-on" all'universita' di trento. Ecco i dettagli:
* WHEN: Wednesday, November 9th, 2-6pm
* WHERE: CIMeC seminar room on the third floor of Palazzo Fedrigotti (C.so Bettini 31, Rovereto)
* WHO: Elia Bruni (Trento) and Claudio Martella (Amsterdam)
* WHAT: Hadoop - A Hands-on Introduction
The objective of this seminar is to give the audience an overview of the Hadoop framework and a basic understanding on how to write MapReduce programs.
“The Apache™ Hadoop™ project develops open-source software for reliable, scalable, distributed computing. The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using a simple programming model. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-avaiability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-availabile service on top of a cluster of computers, each of which may be prone to failures.”[http://hadoop.__apache.org http://hadoop.apache.org/]
Concretely we will give an introduction about the architecture of an Hadoop application, why and when it’s useful to use Hadoop and how the programming paradigm works. We will introduce the Hadoop Streaming interface which allows us to write MapReduce programs in any language.
Later we will go through real-world examples from typical NLP problems such as word co-occurrences, cosine similarity, etc. to show how Hadoop can be useful to scale these algorithms over big datasets. The Hadoop Streaming interface gives us the opportunity to work on the examples with languages of our choice (i.e. python, Java).
* Elia Bruni is a PhD student at Center of Language, Interaction andComputation (CLIC) at CIMeC, University of Trento. His research is in the areas of NLP and computer vision.
* Claudio Martella is a PhD student at the Large-Scale Distributed Systems group of the Vrije Universiteit Amsterdam where he works on Complex Networks and Distributed Graph Processing.
Ciao Claudio, qualche info riguardo a iscrizione, prezzi, etc etc?
Grazie, Andrea
On Sun, Nov 6, 2011 at 11:50 PM, Claudio Martella claudio.martella@tis.bz.it wrote:
Ciao a tutti,
per quelli interessati a sapere di piu su Hadoop, mercoledi 9 terro' un seminario "hands-on" all'universita' di trento. Ecco i dettagli:
WHEN: Wednesday, November 9th, 2-6pm
WHERE: CIMeC seminar room on the third floor of Palazzo Fedrigotti
(C.so Bettini 31, Rovereto)
WHO: Elia Bruni (Trento) and Claudio Martella (Amsterdam)
WHAT: Hadoop - A Hands-on Introduction
The objective of this seminar is to give the audience an overview of the Hadoop framework and a basic understanding on how to write MapReduce programs.
“The Apache™ Hadoop™ project develops open-source software for reliable, scalable, distributed computing. The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using a simple programming model. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-avaiability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-availabile service on top of a cluster of computers, each of which may be prone to failures.”[http://hadoop.__apache.org http://hadoop.apache.org/]
Concretely we will give an introduction about the architecture of an Hadoop application, why and when it’s useful to use Hadoop and how the programming paradigm works. We will introduce the Hadoop Streaming interface which allows us to write MapReduce programs in any language.
Later we will go through real-world examples from typical NLP problems such as word co-occurrences, cosine similarity, etc. to show how Hadoop can be useful to scale these algorithms over big datasets. The Hadoop Streaming interface gives us the opportunity to work on the examples with languages of our choice (i.e. python, Java).
- Elia Bruni is a PhD student at Center of Language, Interaction
andComputation (CLIC) at CIMeC, University of Trento. His research is in the areas of NLP and computer vision.
- Claudio Martella is a PhD student at the Large-Scale Distributed
Systems group of the Vrije Universiteit Amsterdam where he works on Complex Networks and Distributed Graph Processing.
-- Claudio Martella Free Software & Open Technologies Analyst
TIS innovation park Via Siemens 19 | Siemensstr. 19 39100 Bolzano | 39100 Bozen Tel. +39 0471 068 123 Fax +39 0471 068 129 claudio.martella@tis.bz.it http://www.tis.bz.it
Short information regarding use of personal data. According to Section 13 of Italian Legislative Decree no. 196 of 30 June 2003, we inform you that we process your personal data in order to fulfil contractual and fiscal obligations and also to send you information regarding our services and events. Your personal data are processed with and without electronic means and by respecting data subjects' rights, fundamental freedoms and dignity, particularly with regard to confidentiality, personal identity and the right to personal data protection. At any time and without formalities you can write an e-mail to privacy@tis.bz.it in order to object the processing of your personal data for the purpose of sending advertising materials and also to exercise the right to access personal data and other rights referred to in Section 7 of Decree 196/2003. The data controller is TIS Techno Innovation Alto Adige, Siemens Street n. 19, Bolzano. You can find the complete information on the web site www.tis.bz.it.
Ciao,
On 8 November 2011 09:58, andrea antonello andrea.antonello@gmail.com wrote:
Ciao Claudio, qualche info riguardo a iscrizione, prezzi, etc etc?
Lasciando a Claudio la risposta finale, a naso e AFAIK, trattandosi di un seminario universitario dovrebbe essere ad accesso libero senza bisogno di iscrizione?
Ciao Steevie,
qualche info riguardo a iscrizione, prezzi, etc etc?
Lasciando a Claudio la risposta finale, a naso e AFAIK, trattandosi di un seminario universitario dovrebbe essere ad accesso libero senza bisogno di iscrizione?
non avevo notato la questione universitaria, e' vero.
Nella letta (evidentemente troppo veloce) avevo avuto l'impressione che fosse hands on nel senso che i partecipanti provano gli strumenti (nel qual caso non posso immaginare che si possa fare senza organizzare le iscrizioni). Mi sbaglio?
Grazie, Andrea
-- Stefano David _______________________________________________ http://lists.lugbz.org/cgi-bin/mailman/listinfo/lugbz-list
Ciao Andrea,
il seminario e' gratuito, non vi e' nessuna richiesta di iscrizione, si richiede solo che i partecipanti confermino l'eventuale presenza (ci aiuta nella pianificazione della stanza), portino il loro portatile e seguano delle semplici istruzioni per installare hadoop prima di mercoledi.
Se avessi intenzione di presenziare fammi sapere che ti mando il pdf.
Grazie! Claudio
On 11/8/11 9:58 AM, andrea antonello wrote:
Ciao Claudio, qualche info riguardo a iscrizione, prezzi, etc etc?
Grazie, Andrea
On Sun, Nov 6, 2011 at 11:50 PM, Claudio Martella claudio.martella@tis.bz.it wrote:
Ciao a tutti,
per quelli interessati a sapere di piu su Hadoop, mercoledi 9 terro' un seminario "hands-on" all'universita' di trento. Ecco i dettagli:
WHEN: Wednesday, November 9th, 2-6pm
WHERE: CIMeC seminar room on the third floor of Palazzo Fedrigotti
(C.so Bettini 31, Rovereto)
WHO: Elia Bruni (Trento) and Claudio Martella (Amsterdam)
WHAT: Hadoop - A Hands-on Introduction
The objective of this seminar is to give the audience an overview of the Hadoop framework and a basic understanding on how to write MapReduce programs.
“The Apache™ Hadoop™ project develops open-source software for reliable, scalable, distributed computing. The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using a simple programming model. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-avaiability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-availabile service on top of a cluster of computers, each of which may be prone to failures.”[http://hadoop.__apache.org http://hadoop.apache.org/]
Concretely we will give an introduction about the architecture of an Hadoop application, why and when it’s useful to use Hadoop and how the programming paradigm works. We will introduce the Hadoop Streaming interface which allows us to write MapReduce programs in any language.
Later we will go through real-world examples from typical NLP problems such as word co-occurrences, cosine similarity, etc. to show how Hadoop can be useful to scale these algorithms over big datasets. The Hadoop Streaming interface gives us the opportunity to work on the examples with languages of our choice (i.e. python, Java).
- Elia Bruni is a PhD student at Center of Language, Interaction
andComputation (CLIC) at CIMeC, University of Trento. His research is in the areas of NLP and computer vision.
- Claudio Martella is a PhD student at the Large-Scale Distributed
Systems group of the Vrije Universiteit Amsterdam where he works on Complex Networks and Distributed Graph Processing.
-- Claudio Martella Free Software & Open Technologies Analyst
TIS innovation park Via Siemens 19 | Siemensstr. 19 39100 Bolzano | 39100 Bozen Tel. +39 0471 068 123 Fax +39 0471 068 129 claudio.martella@tis.bz.it http://www.tis.bz.it
Short information regarding use of personal data. According to Section 13 of Italian Legislative Decree no. 196 of 30 June 2003, we inform you that we process your personal data in order to fulfil contractual and fiscal obligations and also to send you information regarding our services and events. Your personal data are processed with and without electronic means and by respecting data subjects' rights, fundamental freedoms and dignity, particularly with regard to confidentiality, personal identity and the right to personal data protection. At any time and without formalities you can write an e-mail to privacy@tis.bz.it in order to object the processing of your personal data for the purpose of sending advertising materials and also to exercise the right to access personal data and other rights referred to in Section 7 of Decree 196/2003. The data controller is TIS Techno Innovation Alto Adige, Siemens Street n. 19, Bolzano. You can find the complete information on the web site www.tis.bz.it.