Monthly Archives: March 2014

Contanalytic team’s presentation at Silverline Mobile Behaviour Challenge

At 7:00 pm (GMT+8) on 27 March, 2014 at 237 South Bridge Road, Singapore, The Silverline Mobile Predictive Behavior Challenge of DEXTRA which came to conclusion. Contanalytics team had a good chance to present our ideas and model before a … Continue reading

Posted in Data Analysis | Tagged , , , | Leave a comment

#115 – Final Project II

(Up to 30 hours) Criteria Group work Work up to Tutorial 114 Facebook III – Keyword Extraction Challenge on Kaggle (http://www.kaggle.com/c/facebook-recruiting-iii-keyword-extraction) Large Scale Hierarchical Text Classification (http://www.kaggle.com/c/lshtc) Yandex – Personalized Web Search Challenge (http://www.kaggle.com/c/yandex-personalized-web-search-challenge) Million Song Dataset Challenge (http://www.kaggle.com/c/msdchallenge) Topics … Continue reading

Posted in Hadoop, Internship | Tagged | Leave a comment

#114 – Apache Mahout 3 – Building Clustering Systems

(Expected 8 hours) Preparation A machine with Mahout Mahout in Action (MiA) book MiA sample code Twitter, Last.fm and Stack Overflow datasets: Link One-site support Work closely with the intern since the set up process is not straightforward Find similar … Continue reading

Posted in Hadoop, Internship | Tagged , | Leave a comment

#113 – Apache Mahout 2– Building Recommenders

(Expected hours: 8) Preparation A machine with Mahout Mahout in Action (MiA) book MiA sample code One-site support Work closely with the intern since the set up process is not straightforward Analyze Wikipedia dataset Design a distributed item-based algorithm Implement … Continue reading

Posted in Hadoop, Internship | Tagged , , | Leave a comment

#112 – Setting Up Mahout

(Expected hours: 8) Preparation Mahout in Action (MiA) book Sample Code of MiA A machine with Java, IntelliJ and Hadoop installed is preferred. One-site support Work closely with the intern since the set up process is not straightforward Setting up … Continue reading

Posted in Hadoop, Internship | Tagged | Leave a comment

March 2014 Oflline

After the Big Data presentation in University of Natural Science, March 15, 2014 Contanalytics team organized the meeting to introduce about Hadoop at Contemi Vietnam company. The meeting start at 9:30 and end at 11:00 with the participation of about … Continue reading

Posted in Hadoop, Internship | Tagged , , | Leave a comment

#111 – Hadoop Map-Reduce – Design Pattern 2

Preparation Hadoop 1.2.1. References ebook: http://www.mediafire.com/view/zpy37nou8v516f2/mapreduce_design_patterns.pdf Ebook’ s related source code: https://github.com/adamjshook/mapreducepatterns Ebook’ s related data: https://www.dropbox.com/s/icdwzmfmkeu7u9i/data1_comments.xml One-site support Answer questions about design pattern filtering. Answer questions about HDFS. Answer questions about the test’s request.   Verification Setup example “hot … Continue reading

Posted in Hadoop, Internship | Tagged , , | Leave a comment

#110 – Hadoop Map-Reduce – Design Pattern 1

Preparation Hadoop 1.2.1. References ebook: http://www.mediafire.com/view/zpy37nou8v516f2/mapreduce_design_patterns.pdf Ebook’ s related source code: https://github.com/adamjshook/mapreducepatterns Ebook’ s related data: https://www.dropbox.com/s/icdwzmfmkeu7u9i/data1_comments.xml   One-site support Answer questions about Map-Reduce model. Answer questions about design pattern summarization. Guidance to run source code in chapter 3. Answer … Continue reading

Posted in Hadoop, Internship | Tagged , , , | Leave a comment

Buổi giới thiệu Big Data tại trường Đại Học Khoa Học Tự Nhiên thành phố Hồ Chí Minh

Contanalytics team rất vui vì với sự cho phép của TS. Lý Quốc Ngọc trưởng bộ môn thị giác máy tính và khoa học Robot của trường ĐH Khoa Học Tự Nhiên, chúng tôi đã có buổi giới thiệu Big … Continue reading

Posted in Big Data, Internship, Offline, Đại Học Khoa Học Tự Nhiên | Tagged , , | Leave a comment