Data Mining & Knowledge Discovery

(Mineração de Dados e Descoberta de Conhecimento)

DAT003/CAIA003 - CPGEI & PPGCA

     

last update: 21/06/2021 16:10


General issues:


Tentative scheduling (subject to changes):

Week

Date

Subject

Lecture notes

Softwares/data

Support videos

1

june, 21st

Introduction: the data mining & knowledge discovery process. Presentation of real-world case-studies

class1a class1b   O que é data-mining (2'53")

2

june, 28th

Types of data and their analysis. Data warehousing. Data collection (webcrawling & webscrapping), dataset construction and data visualization

 

class2a

class2b 

software Orange     DatasetCreation (7'41"), WebscrappingXWebcrawling (3'13"), PowerBI (60'05"), Orange básico (12'44")

3

july, 05th

Classification task: Decision trees. Models, concepts and evaluation metrics.

  software Weka  

4

july, 12th

Classification task: Decision rules. Bagging and boosting

     

5

july, 19th

Associative analysis task: frequent and infrequent pattern discovery

     

6

july, 26th

Clustering task: K-means, hierarchical clustering, cluster quality

     

7

ago, 02nd

Feature selection, dimensionality reduction, Principal Components Analysis (PCA)

     

8

ago, 09th

Multimidia mining      

9

ago, 16th

Text mining

     

10

ago, 23th

 

     

11

ago, 30th

PROJECT PROPOSAL DUE: Short presentation and discussion of proposals for the final project. Including: objective, dataset construction, methods, analysis. Proposals will be analyzed and approval or resubmission will be communicated by e-mail to the students      

 

         
 

sep, 20th

PROJECT REPORT DUE: Full report "paper-like" along with codes and data      
 

sep, 27th

sep, 28th

ORAL PRESENTATION: Live seminar for presenting results final project      

 


Homework:

HW#

Subject

Date due

Link

Datasets

Upload link

1

         

2

         

3

         

4

         

5

         

6

         

7

         

8

         

9

         
10          

 


Support materials and links: