ASDK20006U Advanced Social Data Science II
Mandatory course on MSc programme in Social Data Science at University of Copenhagen. The course is only open for students enrolled in the MSc programme in Social Data Science.
The wealth of new data in the digital society is characterized
by high frequency observations in a high granularity setting,
allowing for both comprehensive and detailed analysis of social and
individual behaviour. Messages in digital form and comments and
conversations on social media have the potential to provide thick
descriptions of social interactions and individual values in
large-scale, sometimes population level, and settings.
At the same time, digitalization of large corpuses of legal,
administrative and political texts allow for dynamic analysis of
evolving social ideas and issues.
At the same time, most digital data do not arrive in simple
accessible, quantifiable and comparable forms, but as text, sound
and pictures. Advanced Social Data Science II focuses on
unstructured data and methods for processing, transforming and
dealing with complex and high dimensional data.
The course presents classic unsupervised learning methods for characterizing and developing typologies and categories of individual and social behaviour, networks and ideas. Furthermore, it introduces state-of-the-art methods of self-supervision and transfer learning for classifying complex unstructured data such as text and images, and relates such data-driven methods to existing theoretical methods and models, as well as quantitative and qualitative methods, in the social sciences.
Knowledge
- Explain the differences between and capabilities of neural network architectures such as CNN, RNN, LSTM and attention-based models.
- Account for various learning strategies, algorithms as well as approaches: clustering and unsupervised learning, supervised learning, semi-supervised learning, transfer learning, multi-task learning.
- Account for the potential of different representations, encodings and transformations of text, structured and unstructured.
Skills
- Extract reliable information from text data using supervised learning and techniques from natural language processing.
- Handle advanced matrix and tensor manipulation using a major deep learning framework (e.g. PyTorch, TensorFlow)
- Apply state-of-the-art deep transfer learning to classify unstructured data.
- Master computer vision methods to extract features from image data.
Competencies
- Integrate theoretical and applied knowledge within the field of Social Data Science and formulate compelling research questions given an unstructured dataset.
- Construct data sets for social science research from unstructured text and media data that are validated and well documented.
- Independently carry out an end-to-end analysis given an unstructured dataset of text or images, including exploratory analysis and discovery using unsupervised methods and supervised learning for measurement, and assessment of model-based biases.
- Critically evaluate the implications of results, considering model limitations and biases, and systematic noise introduced by data collection and sampling methods.
- Communicate results using comprehensive statistics and modern visualization methods in particular plotting new data types to specialists within the academic field.
Examples of course readings:
- Bishop, Christopher: *Pattern Recognition and Machine Learning*. Spring Publishing, 2006.
- Cantu, Francisco & Michelle Torres: "Learning to See: Visual Analysis for Social Science Data".
- Gentzkow, M., Kelly, B. T., & Taddy, M. Text as Data. *Journal of Economic Literature*.
- Grimmer, J., & Stew art, B. M. (2013). Text as data: The promise and pitfalls of automatic content analysis methods for political texts. *Political Analysis*, 21(3), 267-297.
- Hastie, T., & Tibshirani, R. & Friedman, J.(2008). *The Elements of Statistical Learning; Data Mining, Inference and Prediction*.
- Jurafsky, Dan, and James H. Martin. *Speech and language processing*. Vol. 3. London: Pearson, 2014.
- Krippendorff, Klaus. Content analysis: An introduction to its methodology. Sage publications, 2018.
Students must follow Advanced Social Data Science II (ASDSII) concurrently with the Digital Methods course.
- Category
- Hours
- Lectures
- 28
- Preparation
- 112
- Exercises
- 42
- Exam
- 24
- Total
- 206
The Study Administration will sign students on the 2nd semester up for this course.
- Credit
- 7,5 ECTS
- Type of assessment
- Written assignmentThe exams for Advanced Social Data Science II and Digital Methods are combined into a single exam (see also section 6.7), with separate assessment and grading.
Group-based written assignment in the shape of a wiki that students write in groups of 3-4 students. The wiki should contain text (method accounts, analyses etc.) and formulate and evaluate an explicit research question. In addition, it may contain documentation in the shape of code, field-notes, data visualizations etc., if relevant in terms of the project.
Specifics about the ASDSII assessment:
The ASDSII assessment is based on the technical aspects of the wiki material. Students must, thus, formulate a separate wiki section in which the operative use of data is described and discussed. E.g. classification of complex unstructured data or characterization or development of typologies and categories. Aspects of the project pertaining to the ASDSII exam must be clearly marked in the wiki as a stand-alone page. - Exam registration requirements
To be eligible for the exam in ASDSII, it is a requirement that students have completed a number of compulsory problem sets based on a social science question combining knowledge of social science research design with methods from the course. The problem sets must be completed in groups and must be approved by the instructor.
- Aid
- All aids allowed
- Marking scale
- 7-point grading scale
- Censorship form
- No external censorship
- Re-exam
The 2. and 3. examination attempts will be based on a new written assignment, approved by the teacher. It will be possible to do the re-examination individually or in groups of 3 or 4.
If a student fails 1 course, the re-examination will only cover the course that the student did not pass.
If a student fails both courses, the re-examination in the respective courses will be conducted separate, i.e. the re-examination are not combined into one.
Criteria for exam assesment
The exam will be assessed on the basis of the learning outcome (knowledge, skills and competencies) for the course.
Course information
- Language
- English
- Course code
- ASDK20006U
- Credit
- 7,5 ECTS
- Level
- Full Degree Master
- Duration
- 1 block
- Placement
- Spring And Block 4
- Course capacity
- 70 students.
- Study board
- Social Data Science
Contracting department
- Social Data Science
Contracting faculty
- Faculty of Social Sciences
Course Coordinators
- Frederik Georg Hjorth (fh@ifs.ku.dk)