Big Data Analytics (Specialisation studies)

Further education / Technology / Business

These studies offer you research at the cutting edge of creating intelligent services, working alongside scholars and industry in their area. The studies are grounded in the scope and theories of machine learning and the challenges of analytical service design. You will develop detailed knowledge and understanding of how these disciplines blend together in tackling the real problems that face organisations.

Price
600 € + 0% VAT
Conductor
Arcada University of Applied Sciences
Language
English
Scope
30 ECTS
Course dates
29.08.2024-09.05.2025
Last application date
15.06.2024
Entry requirements
An engineering degree or other formal science degree that the UAS otherwise deem sufficient. Prior work experience in software engineering. Read more about our knowledge recommendations in the course description.

Please notice that these studies are conducted as part-time studies and therefore doesn't qualify for a residence permit in Finland.

Educational level for these studies: Master (EQF7)

Salvatore della Vecchia
Former student of BDA at Arcada

One thing I really appreciate is that the education isn’t just theoretical, but we get to work with real data sets. I’m just about to start the last course where we work with actual data from a real company, which is both really interesting and the best way to learn. I appreciate Arcada’s approach to education, that it’s important to study the theory, but equally important to get to do real programming and utilize what you’ve learned. This was something that I missed at my university in Italy, where the studies were mainly theoretical.

Salvatore della Vecchia
Former student of BDA at Arcada

Of course the studies are intense and you have to be prepared to work hard, but it’s worth it. The content has been great. I’ve especially enjoyed going through visual analytics techniques and advanced data mining – something that is hard to find a place to study, since not many are using these advanced techniques yet. To sum it up, yes, I would definitely recommend these studies to others!

Today the same amount of information is created by computers and users around the world in 48 hours, as was created from the beginning of humanity until year 2003. Software encompasses an increasing amount of industries, and as Internet of Things (IoT) solutions become more common, big data analytics must be used to process the data they generate. To find patterns in and being able to handle this enormous amount of data opens for better and more precise fact-based decision-making. Companies and organisations have already realized the opportunities this brings and there is a continuously growing demand of trained big data analytics developers today.

Big Data Analytics specialisation studies gives you an in-depth understanding of how to make use of data in order to create insights. The intended student for these specialisation studies is someone with a programming background who wants to understand how to employ machine learning methods in a business environment. The program is arranged so that you rapidly gain a broad understanding of the essential concepts of big data analytics; descriptive and predictive modelling, as well as optimization. The program emphasizes the importance of understanding how to build analytical solutions with production level code.

As a student you will be involved in projects connected to real-world problems that includes elements of both group work and individual achievements. The focus on real-world problems emphasizes disruptive problem solving through analytics service development. Communication and business acumen is emphasized both through data visualization and traditional pitching.

Are you looking for our Master's degree programme in Big Data Analytics? Read more here.

Study alongside work

The specialisation studies in Big Data Analytics are tailored so that you can attend them alongside full-time employment.

Programme contents

This programme will cover the following subjects.

Introduction to Analytics, 5 ECTS
Machine Learning for Predictive Problems, 5 ECTS
Visual Analytics, 5 ECTS
Machine Learning for Descriptive Problems, 5 ECTS
Big Data Analytics, 5 ECTS
Analytical Service Development, 5 ECTS

The curriculum consists of six courses (5 ECTS credits each) that focus on how to program intelligent services using analytical and machine learning methods. Each of the courses are thought through solving programming problems were the students can make use of Arcada’s Nvidia-sponsored Big Data Lab or computational resources from the Finnish Supercomputing Center (CSC). These tools will enable students to learn how to make use of GPU-accelerated models for processing big data.

Date	Module
Date:	29.8-27.9-2024	Module:	Introduction to Analytics
Date:	10.10-8.11.2024	Module:	Machine Learning for Predictive Problems
Date:	21.11-20.12.2024	Module:	Visual Analytics
Date:	16.1-14.2.2025	Module:	Machine Learning for Descriptive Problems
Date:	27.2-28.3.2025	Module:	Big Data Analytics
Date:	10.4-9.5.2025	Module:	Analytical Service Development

Lectures will be held every other week on Thursdays and Fridays from 13 to 18 on-site.

For more information about these modules see the curriculum below.

Curriculum for Big Data Analytics

Introduction to Analytics (Module 1)

The aim of the course is to introduce the student to the different concepts of implementing an analytics process. Students learn the process of problem solving in analytics from data understanding and preprocessing, through modelling choices and implementation until the interpretation, visualization and utilization of the analysis. We will look at typical real-life

applications of analytics. The course will provide hands-on lectures to performing the steps of modeling and analysis.

Machine Learning for Predictive Problems (Module 2)

You learn a practical approach for predicting with Machine Learning with all the steps starting from data acquisition and preparation, search for optimal parameters, to comparison of different methods and evaluation of results. You can employ a linear model for regression and classification, train neural networks, build data features with deep learning, represent and process natural text with numerical methods.

Visual Analytics (Module 3)

You learn how to lead in turbulent times through data-driven management. You also learn to understand how to become an agent of change for transforming data into insights. People are often visual beings and therefore the focus of the course is on reducing information, through algorithms, that can then be visualized. You develop an understanding of visual analytical methods as a communication medium for business intelligence.

Machine Learning for Descriptive Problems (Module 4)

You learn to efficiently handle massive datasets and extract hidden knowledge from data. You understand how to employ classification and clustering algorithms on three different types of data: text, streaming and graph. You acquire knowledge of methods and programming tools for processing big data on distributed/cloud systems.

Big Data Analytics (Module 5)

You are given an overview of machine learning and how to utilize big data. The methods for descriptive and predictive modelling are introduced for small data, and you are then given an explanation for how similar models can be modified to work with big data. You will be introduced to the analytical process; big data tooling, data-related requirement handling, domain knowledge, modelling and verification of results.

Analytical Service Development (Module 6)

You develop an understanding for planning the analytical process; data-related requirement handling, domain knowledge/modelling expertise and verification of results. Each student completes an industry cap-stone project as part of the course.

Knowledge recommendations for Big Data Analytics

To ensure that you can fully benefit from our specialisation program in Big Data Analytics, we recommend that you have basic knowledge of Python and experience with using the Linux command line. If you are already familiar with these, our program will offer in-depth knowledge and skills that can help you to bring your expertise to the next level. We strive to create a supportive and inspiring environment for our students.

The recommended knowledge includes:

Basic Python

Basic syntax and data types (None, bool, int, float, str, list, tuple, dict).
Understanding the concept of variables and variable scope.
Control structures (if-else, for and while loops).
Ability to write simple functions and use them in programs.
Basic Python libraries like math, random, as well as understanding and utilizing classes from the datetime module.
Ability to handle basic string and list methods for manipulation. For example:

For strings:
- upper(): Converts all the characters in a string to uppercase.
- lower(): Converts all the characters in a string to lowercase.
- strip(): Removes leading and trailing whitespace from a string.
- split(): Splits a string into a list where each word is a list item.
- replace(): Replaces a specified phrase with another specified phrase.
For lists:
- append(): Adds an element at the end of the list.
- insert(): Adds an element at the specified position.
- remove(): Removes the first item with the specified value.
- pop(): Removes the element at the specified position.
- sort(): Sorts the list.
Reading from and writing to files.
Handling errors and exceptions using try/except blocks.
An understanding of Python's logging system.

Intermediate

Handle regular expressions for pattern matching in strings.
Understand object-oriented programming: classes, objects, methods.
Understand and use list comprehensions and lambda functions.
Understand Python's memory management and optimization techniques.

Libraries

Familiarity with the following Python libraries would be beneficial:

NumPy: For numerical computations.
Pandas: For data manipulation and analysis.
Matplotlib: For data visualization.

Your code writing style should be clear, concise, and efficient. Avoid excessive code repetitions by adhering to the DRY (Don't Repeat Yourself) principle. This means that information is not duplicated, and you use appropriate structures to encapsulate repeated code. This not only reduces redundancy but also improves readability, maintainability, and scalability of the code. Code you produce should be self-explanatory, with meaningful variable, function, and class names. Comments should be used to explain the 'why' rather than the 'what' or 'how'. This promotes consistency and makes the code easier to understand and debug. Remember, code is read more often than it is written, so strive for clarity and simplicity.

Basic Linux Terminal Commands

Being comfortable with basic Linux terminal commands is essential to interact with Linux-based systems efficiently. These skills are indispensable for software and data engineers, whom our program is targeting. Knowledge of these terminal commands ensures a deeper understanding of computer systems and equips you with the tools needed for professional development in technology fields.

man: Display the user manual of a command
ssh: Secure shell remote login.
ls: List directory contents.
cd: Change the current directory.
pwd: Print the name of the current directory
cp: Copy files and directories.
mv: Move or rename files and directories.
rm: Remove files and directories.
cat: Concatenate and display file content.
echo: Display a line of text.
head: Output the first part of files.
tail: Output the last part of files.
grep: Search for a specific pattern within files.
find: Search for files in a directory hierarchy.
chmod: Change the permissions of files or directories.
chown: Change the owner and group of files or directories.
df: Report file system disk space usage.
du: Estimate file and directory space usage.
tar: Archive files.
gzip: Compress or expand files.
ps: Report a snapshot of the current processes.
top: Display Linux tasks.
kill: Send a signal to a process.
curl: Transfer data to or from a server.
scp: Securely copy files between a local host and a remote host or between two remote hosts.
rsync: Only transfers the changes made rather than transferring all the files again.

You should be able to use these Linux commands comfortably and understand their options and parameters. You should also understand the concept of Linux file permissions and know how to use pipes (|) and redirection (>, >>, <).

Contact us about the course

Ulla Nyberg

Career Counsellor

E-mail:ulla.nyberg@arcada.fi

Phone number:0294 282 553