> MALIS Home > MALIS Publications
MALIS Publications
 
MALIS Home
 
 

MALIS's publications
1 - 2 - [5] - 10 - 15
last years

- 2017 -

Conference Proceedings :

D.. SINGH, E. MERDIVAN, I.. PSYCHOULA, S.. HANKE, J.. KROPF, M. GEIST, A. HOLZINGER, "Human Activity Recognition using Recurrent Neural Networks". In Cross Domain Conference for Machine Learning and Knowledge Extraction (CD-MAKE), 2017. BibTex
A. MANUKYAN, M.A.. OLIVARES-MENDEZ, H. VOOS, M. GEIST, "Real time degradation identification of UAV using machine learning techniques". In International Conference on Unmanned Aircraft Systems (ICUAS'17), 2017. BibTex

Workshops :

M. GEIST, B. PIOT, O. PIETQUIN, "Faut-il minimiser le résidu de Bellman ou maximiser la valeur moyenne ?". In Journées Francophones sur la Planification, la Décision et l\'Apprentissage pour la conduite de systèmes (JFPDA), 2017. BibTex

- 2016 -

Regular Papers :

B. PIOT, M. GEIST, O. PIETQUIN, "Bridging the Gap between Imitation Learning and Inverse Reinforcement Learning". In IEEE Transactions on Neural Networks and Learning Systems (IEEE TNNLS), 2016. BibTex

Conference Proceedings :

B. PIOT, M. GEIST, O. PIETQUIN, "Batch Policy Iteration Algorithms for Continuous Domains". In European Workshop on Reinforcement Learning (EWRL), 2016. BibTex
J. PÉROLAT, B. PIOT, M. GEIST, B. SCHERRER, O. PIETQUIN, "Softened Approximate Policy Iteration for Markov Games". In International Conference on Machine Learning (ICML), 2016. BibTex
L. EL ASRI, B. PIOT, M. GEIST, R. LAROCHE, O. PIETQUIN, "Score-based Inverse Reinforcement Learning". In International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2016), 2016. BibTex

Thesis :

M. GEIST, "Contrle optimal et apprentissage automatique, applications aux interactions homme-machine", Habilitation Diriger des Recherches, PhD Thesis, Universit de Lille 1, 2016. BibTex

Miscellaneous :

M. GEIST, B. PIOT, O. PIETQUIN, "Should one minimize the expected Bellman residual or maximize the mean value?", arxiv, 2016. BibTex
B. PIOT, M. GEIST, O. PIETQUIN, "Difference of Convex Functions Programming Applied to Control with Expert Data", arxiv, 2016. BibTex

- 2015 -

Regular Papers :

B. SCHERRER, M. GEIST, "Recherche locale de politique dans un espace convexe". In Revue d'Intelligence Artificielle (RIA), 29(6):685-704, 2015. BibTex
M. GEIST, "Soft-max boosting". In Machine Learning, 100(2):305-332, (I discovered after publication that a very similar approach has been published some time ago, see "an iterative method for multi-class cost-sensitive learning" by Abe, Zadrozny and Langford, KDD'04), 2015. BibTex
B. SCHERRER, M. GHAVAMZADEH, V. GABILLON, B. LESNER, M. GEIST, "Approximate Modified Policy Iteration and its Application to the Game of Tetris". In Journal of Machine Learning Research, 16:1629-1676, 2015. BibTex

Conference Proceedings :

M. LAUFFER, F. GENTY, S. MARGUERON, J.L. COLLETTE, J.C. PIHAN, "Automatic recognition system of aquatic organisms by classical and fluorescence microscopy". In Proceedings of SPIE 9506, 9506(1O):1-7, Prague (République Tchèque), 2015. BibTex
S. ROSSIGNOL, "Technique d'assistance par le bruit pour aider l'oprateur de Teager-Kaiser suivre une composante frquentielle perturbe par du bruit". In GRETSI, 2015. BibTex
B. PIOT, M. GEIST, O. PIETQUIN, "Imitation Learning Applied to Embodied Conversational Agents". In Machine Learning and Interactive Systems (MLIS), 2015. BibTex
M. GEIST, "A multiplicative UCB strategy for Gamma rewards". In European Workshop on Reinforcement Learning (EWRL), 2015. BibTex
T.. MUNZER, B. PIOT, M. GEIST, O. PIETQUIN, M.. LOPES, " Inverse Reinforcement Learning in Relational Domains". In International Joint Conferences on Artificial Intelligence (IJCAI), (to appear), 2015. BibTex

- 2014 -

Regular Papers :

H. FREZZA-BUET, "Online Computing of Non-Stationary Distributions Velocity Fields by an Accuracy Controlled Growing Neural Gas". In Neural Networks, 60:203--221, 2014. BibTex
M. GEIST, B. SCHERRER, "Off-policy Learning with Eligibility Traces: A Survey". In Journal of Machine Learning Research (JMLR), 15:289-333, 2014. BibTex
N. FRESSENGEAS, H. FREZZA-BUET, "Cellular Computing and Least Squares for partial differential problems parallel solving". In Journal of Cellular Auromata, 9(1):1-21, 2014. BibTex

Conference Proceedings :

M. LAUFFER, F. GENTY, S. MARGUERON, J.L. COLLETTE, A. LEVEY, Y. HOUZELLE, J.C. PIHAN, "Fluorescence imaging of plant pigments for automatical recognition of aquatic organisms". In Proceedings of the XII Conference on Optical Chemical Sensors and Biosensors:FS-29, Athens (Grèce), 2014. BibTex
B. PIOT, M. GEIST, O. PIETQUIN, "Difference of Convex Functions Programming for Reinforcement Learning". In Advances in Neural Information Processing Systems (NIPS 2014), 2014. BibTex
B. PIOT, O. PIETQUIN, M. GEIST, "Predicting when to laugh with structured classification". In Annual Conference of the International Speech Communication Association (InterSpeech), 2014. BibTex
B. SCHERRER, M. GEIST, "Local Policy Search in a Convex Space and Conservative Policy Iteration as Boosted Policy Search". In European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD), (to appear), 2014. BibTex
B. PIOT, M. GEIST, O. PIETQUIN, "Boosted Bellman Residual Minimization Handling Expert Demonstrations". In European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD), Springer, (to appear), 2014. BibTex
J. FIX, "Dynamic formation of self-organizing maps". In 10th workshop on Self-Organizing Maps, WSOM 2014, 2014. BibTex
D. BAHEUX, H. FREZZA-BUET, J. FIX, "Towards an effective multi-map self organizing recurrent neuronal network". In Proc. ESANN, 2014. BibTex
B. PIOT, M. GEIST, O. PIETQUIN, "Boosted and Reward-regularized Classification for Apprenticeship Learning". In 13th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2014), Paris, France, (accepted, to appear), 2014. BibTex

Thesis :

B. KHOUZAM, "Neural Networks as Cellular Computing Models for Temporal Sequence Porcessing", PhD Thesis, Suplec, 2014. BibTex

Workshops :

B. PIOT, M. GEIST, O. PIETQUIN, "Mthode de minimisation du rsidu de Bellman booste qui tient compte des dmonstrations expertes". In Journes Francophone de Plannification, Dcision et Apprentissage (JFPDA), 2014. BibTex
B. SCHERRER, M. GEIST, "Quand l'optimalit locale implique une garantie globale : recherche locale de politique dans un espace convexe et algorithme d'itration sur les politiques conservatif vu comme une monte de gradient ". In Journes Francophone de Plannification, Dcision et Apprentissage (JFPDA), 2014. BibTex

- 2013 -

Regular Papers :

J. FIX, "Template based black-box optimization of dynamic neural fields". In Neural Networks, 46:40--49, 2013. BibTex
H. FREZZA-BUET, M. GEIST, "A C++ Template-Based Reinforcement Learning Library: Fitting the Code to the Mathematics". In Journal of Machine Learning Research, 14:625 - 628, 2013. BibTex
M. GEIST, O. PIETQUIN, "Algorithmic Survey of Parametric Value Function Approximation". In IEEE Transactions on Neural Networks and Learning Systems, 24(6):845 - 867, pdf, 2013. BibTex
E. KLEIN, B. PIOT, M. GEIST, O. PIETQUIN, "Classification structure pour l'apprentissage par renforcement inverse". In Revue d'Intelligence Artificielle, 27(2/2013):155-170, pdf, 2013. BibTex
B. KHOUZAM, H. FREZZA-BUET, "Distributed Recurrent Self-Organization for Tracking the State of Non-Stationary Partially Observable Dynamical Systems". In Biologically Inspired Cognitive Architectures, 3:87--104, 2013. BibTex
O. PIETQUIN, H. HASTIE, "A survey on metrics for the evaluation of user simulations". In Knowledge Engineering Review, 28(01):59-73, first published as FirstView, 2013. BibTex

Conference Proceedings :

L. DAUBIGNEY, M. GEIST, O. PIETQUIN, "Model-free POMDP optimisation of tutoring systems with echo-state networks". In Proceedings of the 14th SIGDial Meeting on Discourse and Dialogue (SIGDial 2013):102-106, Metz (France), 2013. BibTex
B. PIOT, M. GEIST, O. PIETQUIN, "Learning from demonstrations: Is it worth estimating a reward function?". In Proceedings of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD 2013), Blockeel, Hendrik and Kersting, Kristian and Nijssen, Siegfried and Zelezny, Filip, ISBN: 978-3-642-40987-5, Springer, Lecture Notes in Computer Science, 8188:17-32, Prague (Czech Republic) , 2013. BibTex
E. KLEIN, B. PIOT, M. GEIST, O. PIETQUIN, "A cascaded supervised learning approach to inverse reinforcement learning". In Proceedings of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD 2013), Blockeel, Hendrik and Kersting, Kristian and Nijssen, Siegfried and Zelezny, Filip, ISBN: 978-3-642-40987-5, Springer, Lecture Notes in Computer Science, 8188:1-16, Prague (Czech Republic) , 2013. BibTex
L. DAUBIGNEY, M. GEIST, O. PIETQUIN, "Particle Swarm Optimisation of Spoken Dialogue System Strategies". In Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech 2013), Lyon (France), 2013. BibTex
L. EL ASRI, R. LAROCHE, O. PIETQUIN, "Reward Shaping For Statistical Optimisation Of Dialogue Management". In Proceedings of the International Conference on Statistical Language and Speech Processing (SLSP 2013), Dediu, Adrian-Horia and Martin-Vide, Carlos and Mitkov, Ruslan and Truthe, Bianca, ISBN: 978-3-642-39592-5, Springer, Lecture Notes in Computer Science , 7978:93-101, Tarragona (Spain), 2013. BibTex
O. PIETQUIN, "Inverse Reinforcement Learning for Interactive Systems". In Proceedings of the IJCAI workshop on Machine Learning for Interactive Systems (MLIS 2013), Beijing (China), Invited Speaker, 2013. BibTex
L. DAUBIGNEY, M. GEIST, O. PIETQUIN, "Random Projections: a Remedy for Overfitting Issues in Time Series Prediction with Echo State Networks". In Proceedings of the 38th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2013), Vancouver, Canada, to appear, 2013. BibTex
R. NIEWIADOMSKI, J. HOFMANN, J. URBAIN, T. PLATT, J. WAGNER, B. PIOT, H. CAKMAK, S. PAMMI, T. BAUR, S. DUPONT, M. GEIST, F. LINGENFELSER, G. MCKEOWN, O. PIETQUIN, W. RUCH, "Laugh-aware virtual agent and its impact on user amusement ". In Proceedings of the Twelfth International Conference on Autonomous Agents and Multiagent Systems (AAMAS2013):619-626, Saint Paul, USA, 2013. BibTex

Proceedings :

M. ESKENAZI, M. STRUBE, B. DI EUGENIO, J. WILLIAMS, O. PIETQUIN, "14th annual SIGdial Meeting on Discourse and Dialogue ", 2013. BibTex

Technical Reports :

M. GEIST, B. SCHERRER, "Off-policy Learning with Eligibility Traces: A Survey", Suplec - INRIA, 2013. BibTex

Patents :

J. OSTER, G. CLIFFORD, O. PIETQUIN, M. GEIST, "Periodic Artifact Reduction from Biomedical Signals", patent WO2013052944, 2013. BibTex

Workshops :

B. PIOT, M. GEIST, O. PIETQUIN, "Classification rgularise par la rcompense pour l'Apprentissage par Imitation". In Journes Francophones de Plannification, Dcision et Apprentissage (JFPDA), Lille (FRANCE), 2013. BibTex
M. GEIST, E. KLEIN, B. PIOT, Y. GUERMEUR, O. PIETQUIN, "Around Inverse Reinforcement Learning and Score-based Classification". In 1st Multidisciplinary Conference on Reinforcement Learning and Decision Making (RLDM 2013), Princeton (USA), 2013. BibTex
B. PIOT, M. GEIST, O. PIETQUIN, "Learning from demonstrations: Is it worth estimating a reward function?". In 1st Multidisciplinary Conference on Reinforcement Learning and Decision Making (RLDM 2013), Princeton, USA, 2013. BibTex
L. DAUBIGNEY, M. GEIST, O. PIETQUIN, "Optimisation par essaims particulaires de stratgies de dialogue". In Journes Francophones de Plannification, Dcision et Apprentissage (JFPDA), 2013. BibTex
E. KLEIN, B. PIOT, M. GEIST, O. PIETQUIN, "Apprentissage par renforcement inverse en cascadant classification et rgression". In Journes Francophones de Plannification, Dcision et Apprentissage (JFPDA), 2013. BibTex
B. PIOT, M. GEIST, O. PIETQUIN, "Apprentissage par dmonstrations : vaut-il la peine d'estimer une fonction de rcompense?". In Journes Francophones de Plannification, Dcision et Apprentissage (JFPDA), 2013. BibTex