|Date:||05.03.2019 13:00 - 05.03.2019 14:00|
|Location details:||Join webinar at: https://cscfi.zoom.us/j/226451230|
|Lecturers:|| Apurva Nandan / CSC |
Data is everywhere and it is growing! The growth of data volume has led to many challenges when it comes to processing it. People encounter problems with low CPU/Memory whenever they have tried to analyze unmanageable amount of data. Coupled with the fact, that the time needed for the processing is very high. Let's face it - waiting forever for a job to complete or starting it all over again, if it fails, is never fun. Enter Spark, a high-performance distributed computing framework, allows us to tackle big-data problems by distributing the workload across a cluster of machines, making those workflows painless.
This webinar is a part of the CSC's big data series. We will discuss briefly about Spark, and how to deploy it on CSC's infrastructure.
Prerequisities: The webinar is for everyone interested in the subject.
Joining the webinar: Join the webinar with Zoom: https://cscfi.zoom.us/j/226451230
Tips: At the first time, Zoom asks you to download a launcher plug in. After this, you should be able to open the webinar room from the link. The webinar room is timed, and will open when the webinar begins. When signing in, remember to click "join audio conference by computer". If you are using a headset, it is easiest to plug them in before opening Zoom.