Tuesday, January 28, 2014

Big Data course at CVUT

Thanks to IBM support we will open new course in Big Data at CVUT FEL. The course will offer hands on experience in using the standard Big Data methods such as Hadoop. To exercise the hand on experience we have prepared several text processing tasks. 

Why is important to teach Big Data? The size of processed data is constantly growing. Internet portals, insurance companies, banks, GSM providers, health industry, automotive etc. are accumulating enormous data on their servers. The data contains lot of various information. The new methods for processing the data, understanding the data are being developed at similar pace. It is clear that companies without a large analytical departments and access to the Big Data will not be able to make good decisions. They will not be competitive. This is leading the companies to look for a new experienced people capable of processing and interpreting the data. The role of the university is to be ahead and react on these demand.

The motivation for this course is clear to IBM and the university too and this was the reason to join forces.

The objective of our course is to teach the students the Big Data basics and offer some hands-on experience. The course will focus on methods for extraction, analysis as well as selection of hardware infrastructure for managing persistent data. In the second half of the course we will show how to process streamed data, such as data from social networks. As exercise we will introduce standard analytical methods for text processing.

The course is split in to 13 weeks. We want to cover five main topics:

  1. Hadoop overview - all components and how they work together. Install Hadoop, HW requirements, SW requirements, how to administer, introduce to the basic setup of our cluster.
  2. MapReduce, how to use pre-installed data. The bag of words notion, TF-IDF,  SVD, LDA. 
  3. HDFS, NoSQL databases, HBase, SQL access, Hive,  How to upscale-downscale HDFS. 
  4. What is Mahout, what are the basic algorithms. Run random forest classification task using the Mahout algorithms.
  5. Streamed data – Storm or InfoSphere, real time processing using the Twitter data, simple sentiment algorithm

We will put all the presentations on the web with public access, they all will be in English. You can follow us on the course web pages. Keep the fingers crossed for us, it will be a lot of work but we all are looking forward to play with the latest technologies.


  1. Thanks for the information! A week ago I was at big data training in Singapore. It was really awesome experience for me! Highly recommend everybody to visit such events. You will get a lot of new ideas for your future projects!

  2. Thank you for your post. This is excellent information. It is amazing and wonderful to visit your site.
    Mobile app training institutes

  3. Needed to compose you a very little word to thank you yet again regarding the nice suggestions you’ve contributed here.

    big data training in chennai

  4. It isn't always against the law to are looking for assist and thoughts from folks who keep enjoying inside the difficulty be counted number or writing such paper. Doctoral dissertation help writer is the maximum asked professional dissertation writing provider within the modern-day information of instructional writing services.

  5. There had been instances while the garbage became accumulated in huge garbage cans. Casting off this rubbish come to be an actual mission. Top garbage disposal is accumulated in massive quantity after every few hours. As a stop result, it's miles important to keep it a protracted way some distance from the kitchen then this is first-rate for all chef and housewife.

  6. It's miles often tough for the students to offer you the best undertaking papers at the same time as handling to include all the crucial factors. Pay to do my assignment Australia is a pleasing educational writing consultancy enterprise business enterprise. We've got lots of grad college students from worldwide thru which they may assign us their assignment and we can entire before the day of the cut-off date.


  7. It’s great to come across a blog every once in a while that isn’t the same out of date rehashed material. Fantastic read.

    Digital Marketing Training in Mumbai

    Six Sigma Training in Dubai

    Six Sigma Abu Dhabi

  8. Great post and informative blog.it was awesome to read, thanks for sharing this great content to my vision.
    Good discussion.
    Six Sigma Training in Abu Dhabi
    Six Sigma Training in Dammam
    Six Sigma Training in Riyadh

  9. I found your post while searching for some related information on blog search... Its a great blog, keep posting and update the information.
    Hadoop Training Chennai
    Hadoop Training in Chennai
    Big Data Training in Chennai
    Big Data Training
    Selenium Training in Chennai

  10. This comment has been removed by the author.

  11. Outstanding blog thanks for sharing such wonderful blog with us ,after long time came across such knowlegeble blog. keep sharing such informative blog with us.
    Airport Management Courses in Chennai | Airport Management Training in Chennai | Airline Courses in Chennai | Airport Courses in Chennai | Airline and Airport Management Courses in Chennai

  12. Thanks for sharing this valuable information to our vision. You have posted a worthy blog keep sharing.
    English Speaking Classes in Mumbai
    English Speaking Course in Mumbai
    Spoken English Training in Bangalore

  13. Very informative blog ,Very good information thanks for sharing such wonderful blog with us ,after long time came across such knowlegeble blog. keep sharing such informative blog with us.
    install free SSL certificate | ssl certificate setup for wordpress on google cloud |
    google cloud platform | google cloud wordpress