This is an old revision of the document!

Data challenges

2017-2018 (updated Nov. 9: participants and team composition)

The challenge will be on audio-visual speaker diarization.

Registered students:
  • (October 12) Presentation of the context by Radu Horaud at the Data science seminar.
    Slides of the introductory presentation by Radu Horaud
    Before the seminar, you have to read the two mentioned articles.
  • (around October 20) Delivering the training data set
  • (around November 9) One session to get familiar with the training data set, and some software for video exploration
  • (December 11, room Ensimag H101) Development of some basic solution (baseline). Short oral presentation of the baseline solution and short description of it (1-2 pages report). You may express some particular needs in terms of computational power for the intensive session.
  • (February 7-9, room IM2AG F108, from 9 AM to 5 PM) Intensive session, partly supervised by tutors. February 9 AM: prediction on evaluation data set. February 9 PM: defence of the whole project and final proposal with questions from teachers and other team's members. Report of 4 to 10 pages with experience feedback (description and comparison of the different approaches that were tried).
Data and basic scripts

The data can be downloaded from https://github.com/Stephlat/dataChallengePerception. This page also contains details about the data and basic scripts for data visualization.

Presentation and report

The presentation and report should contain a description of the different approaches you tried, even if some of them were discarded because of unsatisfactory results. The report has to be exhaustive on that point, while the presentation may focus on 2 or 3 of them. There should be associated values and comparisons of the metrics, and on execution times. Try to analyse the shortcomings of the methods, explain how you gradually overcame them, and what could be further improved in your final proposal.

The oral presentation should last from 18 to 20 minutes and will be followed by questions. Presentations will start at 1:30 PM (or maybe 2PM, depending on participant's constraints). You may not spend more than one slide / 1 min 30 on the problem description and data, since these are common to every team.

The report should be from 4 to 10 pages long. You have to deliver it just before the presentations.

Please have a close-to-final version of both reports and slides on Thursday evening.

collab/data_challenge.1517839947.txt.gz ยท Last modified: 2018/02/05 15:12 by jbdurand
