> MALIS Home > MALIS Publications
MALIS Publications
 
MALIS Home
 
 

MALIS's publications
1 - 2 - 5 - [10] - 15
last years

- 2017 -

Conference Proceedings :

D.. SINGH, E. MERDIVAN, I.. PSYCHOULA, S.. HANKE, J.. KROPF, M. GEIST, A. HOLZINGER, "Human Activity Recognition using Recurrent Neural Networks". In Cross Domain Conference for Machine Learning and Knowledge Extraction (CD-MAKE), 2017. BibTex
A. MANUKYAN, M.A.. OLIVARES-MENDEZ, H. VOOS, M. GEIST, "Real time degradation identification of UAV using machine learning techniques". In International Conference on Unmanned Aircraft Systems (ICUAS'17), 2017. BibTex

Workshops :

M. GEIST, B. PIOT, O. PIETQUIN, "Faut-il minimiser le résidu de Bellman ou maximiser la valeur moyenne ?". In Journées Francophones sur la Planification, la Décision et l\'Apprentissage pour la conduite de systèmes (JFPDA), 2017. BibTex

- 2016 -

Regular Papers :

B. PIOT, M. GEIST, O. PIETQUIN, "Bridging the Gap between Imitation Learning and Inverse Reinforcement Learning". In IEEE Transactions on Neural Networks and Learning Systems (IEEE TNNLS), 2016. BibTex

Conference Proceedings :

B. PIOT, M. GEIST, O. PIETQUIN, "Batch Policy Iteration Algorithms for Continuous Domains". In European Workshop on Reinforcement Learning (EWRL), 2016. BibTex
J. PÉROLAT, B. PIOT, M. GEIST, B. SCHERRER, O. PIETQUIN, "Softened Approximate Policy Iteration for Markov Games". In International Conference on Machine Learning (ICML), 2016. BibTex
L. EL ASRI, B. PIOT, M. GEIST, R. LAROCHE, O. PIETQUIN, "Score-based Inverse Reinforcement Learning". In International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2016), 2016. BibTex

Thesis :

M. GEIST, "Contrle optimal et apprentissage automatique, applications aux interactions homme-machine", Habilitation Diriger des Recherches, PhD Thesis, Universit de Lille 1, 2016. BibTex

Miscellaneous :

M. GEIST, B. PIOT, O. PIETQUIN, "Should one minimize the expected Bellman residual or maximize the mean value?", arxiv, 2016. BibTex
B. PIOT, M. GEIST, O. PIETQUIN, "Difference of Convex Functions Programming Applied to Control with Expert Data", arxiv, 2016. BibTex

- 2015 -

Regular Papers :

B. SCHERRER, M. GEIST, "Recherche locale de politique dans un espace convexe". In Revue d'Intelligence Artificielle (RIA), 29(6):685-704, 2015. BibTex
M. GEIST, "Soft-max boosting". In Machine Learning, 100(2):305-332, (I discovered after publication that a very similar approach has been published some time ago, see "an iterative method for multi-class cost-sensitive learning" by Abe, Zadrozny and Langford, KDD'04), 2015. BibTex
B. SCHERRER, M. GHAVAMZADEH, V. GABILLON, B. LESNER, M. GEIST, "Approximate Modified Policy Iteration and its Application to the Game of Tetris". In Journal of Machine Learning Research, 16:1629-1676, 2015. BibTex

Conference Proceedings :

M. LAUFFER, F. GENTY, S. MARGUERON, J.L. COLLETTE, J.C. PIHAN, "Automatic recognition system of aquatic organisms by classical and fluorescence microscopy". In Proceedings of SPIE 9506, 9506(1O):1-7, Prague (République Tchèque), 2015. BibTex
S. ROSSIGNOL, "Technique d'assistance par le bruit pour aider l'oprateur de Teager-Kaiser suivre une composante frquentielle perturbe par du bruit". In GRETSI, 2015. BibTex
B. PIOT, M. GEIST, O. PIETQUIN, "Imitation Learning Applied to Embodied Conversational Agents". In Machine Learning and Interactive Systems (MLIS), 2015. BibTex
M. GEIST, "A multiplicative UCB strategy for Gamma rewards". In European Workshop on Reinforcement Learning (EWRL), 2015. BibTex
T.. MUNZER, B. PIOT, M. GEIST, O. PIETQUIN, M.. LOPES, " Inverse Reinforcement Learning in Relational Domains". In International Joint Conferences on Artificial Intelligence (IJCAI), (to appear), 2015. BibTex

- 2014 -

Regular Papers :

H. FREZZA-BUET, "Online Computing of Non-Stationary Distributions Velocity Fields by an Accuracy Controlled Growing Neural Gas". In Neural Networks, 60:203--221, 2014. BibTex
M. GEIST, B. SCHERRER, "Off-policy Learning with Eligibility Traces: A Survey". In Journal of Machine Learning Research (JMLR), 15:289-333, 2014. BibTex
N. FRESSENGEAS, H. FREZZA-BUET, "Cellular Computing and Least Squares for partial differential problems parallel solving". In Journal of Cellular Auromata, 9(1):1-21, 2014. BibTex

Conference Proceedings :

M. LAUFFER, F. GENTY, S. MARGUERON, J.L. COLLETTE, A. LEVEY, Y. HOUZELLE, J.C. PIHAN, "Fluorescence imaging of plant pigments for automatical recognition of aquatic organisms". In Proceedings of the XII Conference on Optical Chemical Sensors and Biosensors:FS-29, Athens (Grèce), 2014. BibTex
B. PIOT, M. GEIST, O. PIETQUIN, "Difference of Convex Functions Programming for Reinforcement Learning". In Advances in Neural Information Processing Systems (NIPS 2014), 2014. BibTex
B. PIOT, O. PIETQUIN, M. GEIST, "Predicting when to laugh with structured classification". In Annual Conference of the International Speech Communication Association (InterSpeech), 2014. BibTex
B. SCHERRER, M. GEIST, "Local Policy Search in a Convex Space and Conservative Policy Iteration as Boosted Policy Search". In European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD), (to appear), 2014. BibTex
B. PIOT, M. GEIST, O. PIETQUIN, "Boosted Bellman Residual Minimization Handling Expert Demonstrations". In European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD), Springer, (to appear), 2014. BibTex
J. FIX, "Dynamic formation of self-organizing maps". In 10th workshop on Self-Organizing Maps, WSOM 2014, 2014. BibTex
D. BAHEUX, H. FREZZA-BUET, J. FIX, "Towards an effective multi-map self organizing recurrent neuronal network". In Proc. ESANN, 2014. BibTex
B. PIOT, M. GEIST, O. PIETQUIN, "Boosted and Reward-regularized Classification for Apprenticeship Learning". In 13th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2014), Paris, France, (accepted, to appear), 2014. BibTex

Thesis :

B. KHOUZAM, "Neural Networks as Cellular Computing Models for Temporal Sequence Porcessing", PhD Thesis, Suplec, 2014. BibTex

Workshops :

B. PIOT, M. GEIST, O. PIETQUIN, "Mthode de minimisation du rsidu de Bellman booste qui tient compte des dmonstrations expertes". In Journes Francophone de Plannification, Dcision et Apprentissage (JFPDA), 2014. BibTex
B. SCHERRER, M. GEIST, "Quand l'optimalit locale implique une garantie globale : recherche locale de politique dans un espace convexe et algorithme d'itration sur les politiques conservatif vu comme une monte de gradient ". In Journes Francophone de Plannification, Dcision et Apprentissage (JFPDA), 2014. BibTex

- 2013 -

Regular Papers :

J. FIX, "Template based black-box optimization of dynamic neural fields". In Neural Networks, 46:40--49, 2013. BibTex
H. FREZZA-BUET, M. GEIST, "A C++ Template-Based Reinforcement Learning Library: Fitting the Code to the Mathematics". In Journal of Machine Learning Research, 14:625 - 628, 2013. BibTex
M. GEIST, O. PIETQUIN, "Algorithmic Survey of Parametric Value Function Approximation". In IEEE Transactions on Neural Networks and Learning Systems, 24(6):845 - 867, pdf, 2013. BibTex
E. KLEIN, B. PIOT, M. GEIST, O. PIETQUIN, "Classification structure pour l'apprentissage par renforcement inverse". In Revue d'Intelligence Artificielle, 27(2/2013):155-170, pdf, 2013. BibTex
B. KHOUZAM, H. FREZZA-BUET, "Distributed Recurrent Self-Organization for Tracking the State of Non-Stationary Partially Observable Dynamical Systems". In Biologically Inspired Cognitive Architectures, 3:87--104, 2013. BibTex
O. PIETQUIN, H. HASTIE, "A survey on metrics for the evaluation of user simulations". In Knowledge Engineering Review, 28(01):59-73, first published as FirstView, 2013. BibTex

Conference Proceedings :

L. DAUBIGNEY, M. GEIST, O. PIETQUIN, "Model-free POMDP optimisation of tutoring systems with echo-state networks". In Proceedings of the 14th SIGDial Meeting on Discourse and Dialogue (SIGDial 2013):102-106, Metz (France), 2013. BibTex
B. PIOT, M. GEIST, O. PIETQUIN, "Learning from demonstrations: Is it worth estimating a reward function?". In Proceedings of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD 2013), Blockeel, Hendrik and Kersting, Kristian and Nijssen, Siegfried and Zelezny, Filip, ISBN: 978-3-642-40987-5, Springer, Lecture Notes in Computer Science, 8188:17-32, Prague (Czech Republic) , 2013. BibTex
E. KLEIN, B. PIOT, M. GEIST, O. PIETQUIN, "A cascaded supervised learning approach to inverse reinforcement learning". In Proceedings of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD 2013), Blockeel, Hendrik and Kersting, Kristian and Nijssen, Siegfried and Zelezny, Filip, ISBN: 978-3-642-40987-5, Springer, Lecture Notes in Computer Science, 8188:1-16, Prague (Czech Republic) , 2013. BibTex
L. DAUBIGNEY, M. GEIST, O. PIETQUIN, "Particle Swarm Optimisation of Spoken Dialogue System Strategies". In Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech 2013), Lyon (France), 2013. BibTex
L. EL ASRI, R. LAROCHE, O. PIETQUIN, "Reward Shaping For Statistical Optimisation Of Dialogue Management". In Proceedings of the International Conference on Statistical Language and Speech Processing (SLSP 2013), Dediu, Adrian-Horia and Martin-Vide, Carlos and Mitkov, Ruslan and Truthe, Bianca, ISBN: 978-3-642-39592-5, Springer, Lecture Notes in Computer Science , 7978:93-101, Tarragona (Spain), 2013. BibTex
O. PIETQUIN, "Inverse Reinforcement Learning for Interactive Systems". In Proceedings of the IJCAI workshop on Machine Learning for Interactive Systems (MLIS 2013), Beijing (China), Invited Speaker, 2013. BibTex
L. DAUBIGNEY, M. GEIST, O. PIETQUIN, "Random Projections: a Remedy for Overfitting Issues in Time Series Prediction with Echo State Networks". In Proceedings of the 38th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2013), Vancouver, Canada, to appear, 2013. BibTex
R. NIEWIADOMSKI, J. HOFMANN, J. URBAIN, T. PLATT, J. WAGNER, B. PIOT, H. CAKMAK, S. PAMMI, T. BAUR, S. DUPONT, M. GEIST, F. LINGENFELSER, G. MCKEOWN, O. PIETQUIN, W. RUCH, "Laugh-aware virtual agent and its impact on user amusement ". In Proceedings of the Twelfth International Conference on Autonomous Agents and Multiagent Systems (AAMAS2013):619-626, Saint Paul, USA, 2013. BibTex

Proceedings :

M. ESKENAZI, M. STRUBE, B. DI EUGENIO, J. WILLIAMS, O. PIETQUIN, "14th annual SIGdial Meeting on Discourse and Dialogue ", 2013. BibTex

Technical Reports :

M. GEIST, B. SCHERRER, "Off-policy Learning with Eligibility Traces: A Survey", Suplec - INRIA, 2013. BibTex

Patents :

J. OSTER, G. CLIFFORD, O. PIETQUIN, M. GEIST, "Periodic Artifact Reduction from Biomedical Signals", patent WO2013052944, 2013. BibTex

Workshops :

B. PIOT, M. GEIST, O. PIETQUIN, "Classification rgularise par la rcompense pour l'Apprentissage par Imitation". In Journes Francophones de Plannification, Dcision et Apprentissage (JFPDA), Lille (FRANCE), 2013. BibTex
M. GEIST, E. KLEIN, B. PIOT, Y. GUERMEUR, O. PIETQUIN, "Around Inverse Reinforcement Learning and Score-based Classification". In 1st Multidisciplinary Conference on Reinforcement Learning and Decision Making (RLDM 2013), Princeton (USA), 2013. BibTex
B. PIOT, M. GEIST, O. PIETQUIN, "Learning from demonstrations: Is it worth estimating a reward function?". In 1st Multidisciplinary Conference on Reinforcement Learning and Decision Making (RLDM 2013), Princeton, USA, 2013. BibTex
L. DAUBIGNEY, M. GEIST, O. PIETQUIN, "Optimisation par essaims particulaires de stratgies de dialogue". In Journes Francophones de Plannification, Dcision et Apprentissage (JFPDA), 2013. BibTex
E. KLEIN, B. PIOT, M. GEIST, O. PIETQUIN, "Apprentissage par renforcement inverse en cascadant classification et rgression". In Journes Francophones de Plannification, Dcision et Apprentissage (JFPDA), 2013. BibTex
B. PIOT, M. GEIST, O. PIETQUIN, "Apprentissage par dmonstrations : vaut-il la peine d'estimer une fonction de rcompense?". In Journes Francophones de Plannification, Dcision et Apprentissage (JFPDA), 2013. BibTex

- 2012 -

Regular Papers :

J. WILLIAMS, K. YU, B. CHAIB DRAA, O. LEMON, R. PIERACCINI, O. PIETQUIN, P. POUPART, S. YOUNG, "Introduction to the Issue on Advances in Spoken Dialogue Systems and Mobile Interface". In IEEE Journal of Selected Topics in Signal Processing, 6(8):889 - 890, 2012. BibTex
L. DAUBIGNEY, M. GEIST, S. CHANDRAMOHAN, O. PIETQUIN, "A Comprehensive Reinforcement Learning Framework for Dialogue Management Optimisation". In IEEE Journal of Selected Topics in Signal Processing, ISBN: 1932-4553, 6(8):891-902, pdf, 2012. BibTex
J. FIX, N. ROUGIER, "DANA: Distributed numerical and adaptive modelling framework". In Network: Computation in Neural Systems:1-17, 2012. BibTex

Conference Proceedings :

F. PENNERATH, "L'extraction de rgles de dpendance bien dfinies entre ensembles de variables multivalues". In Extraction et gestion des connaissances (EGC'2012), Actes, janvier 31 - fvrier 2012, Bordeaux, France, Yves Lechevallier and Guy Melanon and Bruno Pinaud, ISBN: 978-2-70568-310-8, Revue des Nouvelles Technologies de l'Information, RNTI-E-23:249-254, Hermann-Editions, 2012. BibTex
S. CHANDRAMOHAN, M. GEIST, F. LEFVRE, O. PIETQUIN, "Co-adaptation in Spoken Dialogue Systems". In Proceedings of the Fourth International Workshop on Spoken Dialog Systems (IWSDS 2012), Paris (France), 2012. BibTex
E. KLEIN, M. GEIST, B. PIOT, O. PIETQUIN, "Inverse Reinforcement Learning through Structured Classification". In Advances in Neural Information Processing Systems (NIPS 2012), Lake Tahoe (NV, USA), 2012. BibTex
S. CHANDRAMOHAN, M. GEIST, F. LEFVRE, O. PIETQUIN, "Behavior Specific User Simulation in Spoken Dialogue Systems". In Proceedings of the 10th ITG Conference on Speech Communication:1 - 4, Braunschweig (Germany), http://www.metz.supelec.fr/~geist_mat/pdfs/Supelec792.pdf, 2012. BibTex
L. EL ASRI, R. LAROCHE, O. PIETQUIN, "Reward Function Learning for Dialogue Management". In Proceedings of the sixth Starting Artificial Intelligence Research Symposium (STAIRS 2012):95 - 106, Montpellier (France), 2012. BibTex
O. PIETQUIN, F. TANGO, "A Reinforcement Learning Approach to Optimize the longitudinal Behavior of a Partial Autonomous Driving Assistance System ". In Proceedings of the European Conference on Artificial Intelligence (ECAI 2012) and the seventh Conference on Prestigious Applications of Intelligent Systems (PAIS 2012):987 - 992, Montpellier (France), Best Paper Award, 2012. BibTex
B. SCHERRER, V. GABILLON, M. GHAVAMZADEH, M. GEIST, "Approximate Modified Policy Iteration". In International Conference on Machine Learning (ICML), (to appear), 2012. BibTex
M. GEIST, B. SCHERRER, A. LAZARIC, M. GHAVAMZADEH, "A Dantzig Selector Approach to Temporal Difference Learning". In International Conference on Machine Learning (ICML), (to appear), 2012. BibTex
O. PIETQUIN, "Statistical User Simulation for Spoken Dialogue Systems: What for, Which Data, Which Future? ". In Proceedings of the NAACL-HLT 2012 Workshop on Future Directions and Needs in the Spoken Dialog Community: Tools and Data (SDCTD 2012):9-10, Montral (Canada), Invited position paper, 2012. BibTex
J. OSTER, M. GEIST, O. PIETQUIN, G. CLIFFORD, "Filtering of pathological ventricular rhythms during MRI scanning". In Proceedings of the seventh International Workshop on Biosignal Interpretation, Como (Italy), 2012. BibTex
E. KLEIN, B. PIOT, M. GEIST, O. PIETQUIN, "Classification structure pour lapprentissage par renforcement inverse". In Actes de la Confrence Francophone sur l'Apprentissage Automatique (Cap 2012):1-16, Nancy, France, 2012. BibTex
J. FIX, M. GEIST, "Optimisation de contrleurs par essaim de particules". In Actes de la Confrence Francophone sur l'Apprentissage Automatique (Cap 2012):1-14, Nancy, France, 2012. BibTex
L. DAUBIGNEY, M. GEIST, O. PIETQUIN, "Optimisation d'un tuteur intelligent partir d'un jeu de donnes fix". In Actes des Journes d'Etudes de la Parole (JEP 2012):241-248, Grenoble (France), 2012. BibTex
S. CHANDRAMOHAN, M. GEIST, F. LEFVRE, O. PIETQUIN, "Clustering Behaviors Of Spoken Dialogue Systems Users". In Proceedings of the 37th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2012), IEEE:4981 - 4984, Kyoto (Japan), 2012. BibTex
L. DAUBIGNEY, M. GEIST, O. PIETQUIN, "Off-policy Learning in Large-scale POMDP-based Dialogue Systems". In Proceedings of the 37th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2012), IEEE:4989 - 4992, Kyoto (Japan), 2012. BibTex
L. DAUBIGNEY, M. GEIST, O. PIETQUIN, "Apprentissage off-policy appliqu un systme de dialogue bas sur les PDMPO". In Actes du 18me congrs francophone sur la Reconnaissance de Formes et l'Intelligence Artificielle (RFIA 2012), Lyon (France), 2012. BibTex

Book Chapters :

S. KEIZER, S. ROSSIGNOL, S. CHANDRAMOHAN, O. PIETQUIN, "User Simulation in the Development of Statistical Spoken Dialogue Systems". In Data-Driven Methods for Adaptive Spoken Dialogue Systems: Computational Learning for Conversational Interfaces, Oliver Lemon - Olivier Pietquin, ISBN: 978-1461448020, Springer, 4:39-73, 2012. BibTex
J. FIX, M. GEIST, "Monte-Carlo Swarm Policy Search". In Symposium on Swarm Intelligence and Differential Evolution, Springer Verlag - Heidelberg Berlin, Lecture Notes in Artificial Intelligence (LNAI):9 pages, Zakopane (Poland), 2012. BibTex
J. GUSTEDT, S. VIALLE, H. FREZZA-BUET, D. BOUMBA SITOU, N. FRESSENGEAS, J. FIX, "InterCell: a Software Suite for Rapid Prototyping and Parallel Execution of Fine Grained Applications". In Applied Parallel and Scientific Computing, 10th International Conference, PARA 2010, Proceedings, Part I, Kristjn Jnasson (Ed), Springer, Heidelberg, LNCS, 7133, Extended and selected paper of PARA 2010 conference, 11 pages, 2012. BibTex

Proceedings :

O. PIETQUIN, J. FIX, M. GEIST, "Proceedings of the 8th eNTERFACE workshop", Metz (France), 2012. BibTex

Books :

O. LEMON, O. PIETQUIN, "Data-Driven Methods for Adaptive Spoken Dialogue Systems: Computational Learning for Conversational Interfaces ", Oliver Lemon - Olivier Pietquin, ISBN: 978-1461448020, Springer, 177 pages, 2012. BibTex

Thesis :

S. CHANDRAMOHAN, "Revisiting User Simulation in Dialogue Systems: Do we still need them? Will imitation play the role of simulation?", PhD Thesis, University of Avignon, PhD Thesis in Computer Science, 2012. BibTex

Miscellaneous :

K. YU, J. WILLIAMS, B. CHAIB DRAA, O. LEMON, R. PIERACCINI, O. PIETQUIN, P. POUPART, S. YOUNG, "IEEE Journal on Selected Topics in Signal Processing : special issue on Advances in Spoken Dialogue Systems and Mobile Interfaces", 2012. BibTex

Workshops :

E. KLEIN, B. PIOT, M. GEIST, O. PIETQUIN, "Structured Classification for Inverse Reinforcement Learning". In European Workshop on Reinforcement Learning (EWRL 2012), Edinburgh (UK), 2012. BibTex
M. GEIST, B. SCHERRER, A. LAZARIC, M. GHAVAMZADEH, "Un slecteur de Dantzig pour l'apprentissage par diffrences temporelles". In Journes Francophones sur la Planification, la Dcision et l'Apprentissage pour la conduite des systmes (JFPDA), 2012. BibTex
B. SCHERRER, V. GABILLON, M. GHAVAMZADEH, M. GEIST, "Approximations de l'algorithme Itrations sur les Politiques Modifi". In Journes Francophones sur la Planification, la Dcision et l'Apprentissage pour la conduite des systmes (JFPDA), 2012. BibTex
S. CHANDRAMOHAN, M. GEIST, F. LEFVRE, O. PIETQUIN, "Regroupement non-supervis d'utilisateurs par leur comportement pour les systmes de dialogue parl". In Journes Francophones de Planification, Dcision et Apprentissage pour la conduite de systmes (JFPDA 2012), Nancy (France), 2012. BibTex