[DL] Two Ph.D positions in semantic multimedia processing at FBK

Luciano Serafini serafini at fbk.eu
Thu May 7 12:45:13 CEST 2015

[Please circulate to all the intersted people / your contacts]

			   2 PhD positions
  			 Cognitive Computing
		  Center for Information Technology

			  Duration: 3 years
		       Close Date: May 27, 2015

			Formal application at:

These position will be part of the Joint Project "Understanding multimedia content (UMC)" of the Cognitive Computing research line (http://ict.fbk.eu/cognitive-computing). The UMC project is developed in collaboration between the Natural Language Processing research unit (hlt.fbk.eu) the Technologies for Vision research init (tev.fbk.eu) and the Data and Knowledge Management research unit (dkm.fbk.eu).


This phd has the objective of extracting events from commented videos exploiting background knowledge available in the semantic web. This phd should develop a holistic approach, where the process of extracting information from the video, and from the associated text are integrated and can affect each other at any stage. This implies that video stream and textual stream are considered as a whole information space and their interpretations are not independent. Furthermore, video-text interpretation should not happen in the knowledge vacuum, but it should exploit the existing large amount of background knowledge available in the semantic web under the form of ontologies and RDF data. Nowadays--in contrast with the early years of AI when knowledge acquisition was a bottleneck--large amount of commonsense knowledge is available in the semantic web, but it cannot be easily exploited by the state-of-the-art approaches to video and text analisys. The thesis should investigate on how to extend and adapt algorithms for video and text analysis in order to inject background knowledge. The thesis, to reach it's objective, should combine techniques in machine learning--for processing low level data--with automated reasoning--to manage with high level semantic knowledge.


Multimedia content analysis more and more relies on advanced machine learning to capture the enormous richness of multi-modal sources (commented videos, images with captions, etc.). At the other side, domain specific knowledge is often available to leverage the content analysis task, but effectively encoding it into machine learning (down to the development of task-specific feature representations) is still an open research issue. The goal of this PhD is to progress on the computer vision side of the problem, to go beyond a mono-modal approach where supervisions for learning are provided explicitly. Instead, we will investigate how structured (background knowledge) and semi-structured data (e.g. text captions and descriptions) can be used to provide implicit supervision to enrich the task-specific visual learning capabilities.

SKILLS: Candidates are required to have basic or advanced skills in one or more of the following areas: artificial intelligence, machine learning, computer vision, natural language processing and knowledge representation techniques.

For further infos please contact:

Dr. Oswald Lanz 
TEV, Fondazione Bruno Kessler
Email: lanz at fbk.eu
Web: tev.fbk.eu/people/profile/lanz

Dr. Bernardo Magnini
HLT, Fondazione Bruno Kessler
Email: magnini at fbk.eu
Web: hlt.fbk.eu/people/profile/magnini

Dr. Luciano Serafini 
DKM, Fondazione Bruno Kessler
Email: serafini at fbk.eu
Web: dkm.fbk.eu/people/profile/serafini

More information about the dl mailing list