macau university of science and technology
master in applied mathematics and data science
basic core courses
mimz01 mathematics methods for data science (3 credits)
this course provides some topics including various basic mathematics methods commonly used in optimization. we will cover basic definitions, concepts, and results from convex analysis and convex optimization, a variety of applications of convex optimization, in areas like probability and statistics, computational geometry, and data fitting. we will describe numerical methods for solving convex optimization problems, focusing on newton’s algorithm and interior-point methods.
mimz02 numerical linear algebra (3 credits)
this course supplies an introduction to the basics of linear algebra. then the course provides some common topics in numerical computation. such as, conditioning of problems and stability of algorithms、 gaussian elimination and lu decomposition、 gram-schmidt orthonormalization、 least squares problems、 eigenvalue problems、 singular value decomposition as well as basic iterative methods. furthermore, it describes how to implement related algorithms.
mimz03 open source tool for data science (3 credits)
this course mainly introduces the basic syntax and control structure of python language, and then introduce the commonly used modules in data analysis such as numpy, pandas, mathplotlib, sqlite3, sklearn etc. finally, it introduces common data analysis operations, such as crawling network data, regular expressions, storing and accessing data, regression and classification, cluster analysis, principal component analysis, time series analysis and prediction. additionally, this cource will also introduce the use of other open source tools, including sql, shell, julia, opencv, etc.
mimz04 applied statistics (3 credits)
this course provides fundamentals of probability and statistics for data analysis in application and research. topics include data collection, exploratory data analysis, random variables, common discrete and continuous distributions, sampling distributions, estimation, confidence intervals, hypothesis tests, regression model, analysis of variance, and multivariate statistical analysis and bayesian statistics et.
mimz05 data mining (3 credits)
this course introduces the latest data mining technology and its application. the object of the course is to help students understand the principles and the importance of data mining technology and mainly focus on the technical developments of data mining and its related subject such as artificial intelligence and machine learning. topics of this course include the concepts and techniques of data science, such as statistical descriptions of data, data visualization, data preprocessing, data warehousing, frequent pattern mining and association rule analysis, classification and supervised learning, clustering and unsupervised learning, variable selection. to realize related algorithms by python are also required.
mimz06 machine learning (3 credits)
this course will cover a wide range of concepts and techniques such as machine learning, data mining and statistical pattern recognition. more specifically, topics will include: (1) supervised learning (e.g. parametric/nonparametric algorithms, support vector machines, kernel methods and neural networks), (2) unsupervised learning (e.g. clustering, dimension reduction and recommendation systems) and (3) advanced topics in machine learning.
mimz07 time series analysis (3 credits)
this course is intended to provide students with an introduction to the basic knowledge and methods of analyzing real data of time series analysis. it introduces time series decomposition, moving average method, exponential moving average method, as well as basic knowledge such as correlation, stationarity. in addition, the course presents traditional time series models, such as bass model, holt-winters exponential smoothing model, linear model, harmonic seasonal model, random walk, moving average process, autoregressive process, autoregressive conditional heteroskedastic model. these models will be used to fit real data to help better understand and use. r language will be used to make graphs and analyze data. these contents are helpful for time series theoretical research and interpretation of real-world data.
elective courses:
mime01 advanced topics in applied mathematics (3 credits)
this course mainly introduces practical topics in applied mathematics, such as numerical methods of inverse problem in mathematical physics. the course covers truncated singular value decomposition, tikhonov regularization method, variation regularization, and statistical inversion(markov chain monte carlo sampling and bayesian inference). additionally, some applications including computed tomography, convolution and image deblurring will also be included.
mime02 advanced topics in data science (3 credits)
this course introduces the latest theories and applications in data science, such as deep learning and its application to computer vision and natural language processing. deep learning is a branch of machine learning concerned with the development and application of modern neural networks. deep learning algorithms extract layered high-level representations of data in a way that maximizes performance on a given task. the course will cover a range of topics from basic neural networks, convolutional and recurrent network structures, deep unsupervised and reinforcement learning, and applications to problem domains like natural language processing and computer vision.
mime03 programming in data science (3 credits)
this course aims to focus on algorithms, models, and frameworks for deep learning and its programming. it specifically deals with deep learning with pytorch, including numpy, pandas, machine learning theory, test/train/validation data split, model evaluation, tensors with pytorch, neural network theory (perceptron, network, activation function, cost/loss function, backpropagation, gradient), artificial/deep neural network (ann/dnn), convolutional neural network (cnn), recurrent neural network (rnn, lstm, gru), nlp with pytorch, using gpu with pytorch, and many more.
mime04 digital image processing (3 credits)
this course will give lectures to introduce the principle, technique and application of digital image processing and pattern recognition, including digital image preprocessing, feature extracting and analysis; statistical pattern recognition and structural pattern recognition and their application in different areas. students will be asked to select some special topics in the prip area based on the contents they have learnt from the course, search and read related papers, and then give a survey report on the topics selected.
mime05 data visualization and analyzation (3 credits)
this course will focus on the visualization techniques commonly used in data processing, including multi-dimension display of data with various feature distributions and popular modules in python such as matplotlib and seaborn.
mime06 data warehouse and data mining (3 credits)
this course will introduce the principle, technique and application of data warehouse and data mining, including data warehousing and on-line analytical processing (olap), data preprocessing techniques (data cleaning, integration, transformation and reduction), data mining techniques (data classification, prediction, correlation and clustering), their application and developing trends.
mime07 stochastic processes (3 credits)
stochastic process is to study time-varying random phenomena. this course will introduce the basic theory and applications of stochastic processes from an engineering perspective, including basic concepts of stochastic processes, possion processes, markov chains, queuing theory.
mime08 multimedia signals and systems (3 credits)
this subject intends to introduce students to the notion of multimedia signals and their processing techniques. there are various methods for representing a multimedia signal, e.g., time domain, frequency domain, time-frequency domain, and eigen-domain. such representations will be used to characterize multimedia signals. moreover, filter designs for multimedia signals will be considered. some adaptive processing techniques, e.g., hidden markov models, random field models, state space models will be considered for modeling multimedia signals.
mime09 database systems (3 credits)
the course aims to provide a foundation in understanding of database design principles, implementation and management. upon completion, students should be able to identify and execute the steps involved in the design of a database, implement the design via a relational database management system, maintain the goal of data sharing and consistency of database systems.