Programme JFPDA

Les exposés invités durent une heure en tout, questions comprises. Les exposés des papiers durent une demi-heure en tout, questions comprises.

Session pléniaire 30 juin 9h-10h

Président : Bruno Zanuttini

Ann Nowé. Aspects of transparency in Reinforcement Learning.

[https://pfia2021.fr/invites/?p=nowe]

Session mercredi 30 juin 10h30-12h30

Président : Bruno Zanuttini

Accueil
Ahmed Akakzia, Cédric Colas, Pierre-Yves Oudeyer, Mohamed Chetouani and Olivier Sigaud. Grounding Language to Autonomously-Acquired Skills via Goal Generation
Aurelien Delage, Olivier Buffet and Jilles Dibangoye. HSVI for zs-POSGs using Convexity, Concavity and Lipschitz Properties
Arthur Queffelec, Ocan Sankur and Francois Schwarzentruber. Planning for Connected Agents in a Partially Known Environment

Session mercredi 30 juin 15h30-17h30

Président : François Schwarzentruber

Exposé invité. Thomas Bolander (DTU, Denmark)
Sergej Scheck, Alexandre Niveau and Bruno Zanuttini. Explicit Representations of Persistency for Propositional Action Theories
Sébastien Gamblin, Alexandre Niveau and Maroua Bouzid. Vérification symbolique de modèles pour la logique épistémique dynamique probabiliste

Session jeudi 1er juillet 10h30-12h30

Président : François Schwarzentruber

Exposé invité. Odalric-Ambrym Maillard (INRIA, Lille)
Yang You, Vincent Thomas, Francis Colas and Olivier Buffet. Solving infinite-horizon Dec-POMDPs using Finite State Controllers within JESP
Table ronde
Clotûre

Liste des papiers acceptés

Ahmed Akakzia, Cédric Colas, Pierre-Yves Oudeyer, Mohamed Chetouani and Olivier Sigaud. Grounding Language to Autonomously-Acquired Skills via Goal Generation
Aurelien Delage, Olivier Buffet and Jilles Dibangoye. HSVI for zs-POSGs using Convexity, Concavity and Lipschitz Properties
Arthur Queffelec, Ocan Sankur and Francois Schwarzentruber. Planning for Connected Agents in a Partially Known Environment
Yang You, Vincent Thomas, Francis Colas and Olivier Buffet. Solving infinite-horizon Dec-POMDPs using Finite State Controllers within JESP
Sergej Scheck, Alexandre Niveau and Bruno Zanuttini. Explicit Representations of Persistency for Propositional Action Theories
Sébastien Gamblin, Alexandre Niveau and Maroua Bouzid. Vérification symbolique de modèles pour la logique épistémique dynamique probabiliste

Exposés invités

Odalric-Ambrym Maillard (INRIA, Lille)

Titre: Apprentissage par renforcement, régions de confiance et agroécologie.

Résumé: Dans cet exposé, je rappellerai dans un premier temps les grands principes de l'apprentissage par renforcement, dans le cadre des processus décisionnels de Markov et des bandits manchots. Dans les approches orientées modèle, je montrerai comment la construction d'intervalles et régions de confiance fins pour prendre en compte l'incertitude peut améliorer les performances et analyses des algorithmes de l'état de l'art, parfois significativement. En pratique, les systèmes dynamiques considérés font apparaître diverses notions de structures (équivalences, régularité Lipschitz, linéarité, etc.). La troisième partie de l'exposée s'intéressera à la prise en compte de ces structures dans la construction de stratégies d'apprentissage optimales. Pour illustration, nous nous placerons dans le cadre de la construction d'un compagnon de décision séquentielle en agroécologie. Ce cadre ouvre de nombreuses questions de recherche que nous esquisserons au cours de l'exposé.

Thomas Bolander (DTU, Danemark)

Title: Theory of Mind and Epistemic Planning for Human-Robot Collaboration

Abstract: Epistemic planning is a branch of automated planning where planning agents explicitly reason about the mental states of other agents: their beliefs, knowledge, plans and goals. Essentially the idea is to give the planning agents a Theory of Mind to be able to take the perspective of other agents, hence improving collaboration and coordination. The talk will give a brief introduction to epistemic planning based on dynamic epistemic logic (DEL), and will then mainly focus on how to implement DEL-based planning systems on physical robots and apply these implementations to human-robot collaboration scenarios. One of the recurring themes is implicit coordination: how to successfully achieve joint goals in decentralised multi-agent systems without prior negotiation or coordination, but exclusively through the ability to take the perspective of the other agents.

JFPDA

Journées Francophones Planification, Décision et Apprentissage

(Evènement affilié à PFIA 2021)
30 Juin - 1er Juillet 2021, Bordeaux

Comment participer (en virtuel) à la conférence ?

Programme JFPDA

Session pléniaire 30 juin 9h-10h

Session mercredi 30 juin 10h30-12h30

Session mercredi 30 juin 15h30-17h30

Session jeudi 1er juillet 10h30-12h30

Liste des papiers acceptés

Exposés invités

Odalric-Ambrym Maillard (INRIA, Lille)

Thomas Bolander (DTU, Danemark)

JFPDA

Journées Francophones Planification, Décision et Apprentissage

(Evènement affilié à PFIA 2021) 30 Juin - 1er Juillet 2021, Bordeaux

Comment participer (en virtuel) à la conférence ?

Programme JFPDA

Session pléniaire 30 juin 9h-10h

Session mercredi 30 juin 10h30-12h30

Session mercredi 30 juin 15h30-17h30

Session jeudi 1er juillet 10h30-12h30

Liste des papiers acceptés

Exposés invités

Odalric-Ambrym Maillard (INRIA, Lille)

Thomas Bolander (DTU, Danemark)

(Evènement affilié à PFIA 2021)
30 Juin - 1er Juillet 2021, Bordeaux