???global.info.a_carregar???
Mohamed Khemakhem is the core developer of GROBID-Dictionaries, a machine-learning library for structuring digitised lexical resources. He is currently a research associate at Inria, Centre de Recherche de Paris (ALMAnaCH Lab), and is completing his Doctoral Studies in Computer Science at Paris Diderot University with a thesis on "Standard-based Lexical Models for Automatically Structured Dictionaries". He is a member of several standardising committies on language resources (ISO, DIN, AFNOR), and co-project leader of ISO 24613-4 “TEI-Serialization”. His research interests include digitisation of lexical resources, machine learning based information extraction and digital humanities.
Identificação

Identificação pessoal

Nome completo
Mohamed Khemakhem

Nomes de citação

  • Khemakhem, Mohamed

Identificadores de autor

Ciência ID
F212-7C38-1418
ORCID iD
0000-0003-3529-2990
Google Scholar ID
7uv0BkAAAAJ&hl

Telefones

Telefone
  • (+33) 0768676304 (Profissional)

Domínios de atuação

  • Ciências Exatas - Ciências da Computação e da Informação - Ciências da Computação

Idiomas

Idioma Conversação Leitura Escrita Compreensão Peer-review
Árabe (Idioma materno)
Francês (Idioma materno)
Inglês Utilizador proficiente (C2) Utilizador proficiente (C2) Utilizador proficiente (C2) Utilizador proficiente (C2) Utilizador proficiente (C2)
Alemão Utilizador independente (B2) Utilizador independente (B2) Utilizador independente (B2) Utilizador proficiente (C1) Utilizador independente (B1)
Formação
Grau Classificação
2016/09 - 2020/08
Em curso
Doctoral Studies in Computer Science (Doctor)
Université Paris Diderot, França
"Standard-based Lexical Models for Automatically Structured Dictionaries" (TESE/DISSERTAÇÃO)
2010/09 - 2012/12
Concluído
Master in Information Systems and New Technologies (Master)
Université de Sfax Faculté des Sciences Economiques et de Gestion de Sfax, Tunísia
"Collaborative Construct and Query System for a Standardized Arabic Dictionary" (TESE/DISSERTAÇÃO)
2007/09 - 2010/06
Concluído
Bachelor in Computer Science Applied to Management (Bachelor (1.º ciclo de estudos))
Université de Sfax Faculté des Sciences Economiques et de Gestion de Sfax, Tunísia

Habib Maazoon High School, Tunísia
"Views Creation for the Interactive Standardized Arabic Dictionary" (TESE/DISSERTAÇÃO)
Percurso profissional

Ciência

Categoria Profissional
Instituição de acolhimento
Empregador
2016/06 - Atual Investigador (Investigação) Inria Centre de Recherche de Paris, ALMAnaCH Lab, França
Centre Marc Bloch, Alemanha
2019/09/01 - 2020/08/31 Pós-doutorado (Investigação) Université Grenoble Alpes, França
2019/09 - 2020/08 Investigador (Investigação) Inria Centre de Recherche de Paris, ALMAnaCH Lab, França
Université Grenoble Alpes, França
2014/09 - 2016/03 Investigador (Investigação) Ubiquitous Knowledge Processing (UKP) Lab, Alemanha
2012/02 - 2012/12 Investigador (Investigação) MIRACL Lab, Tunísia
Projetos

Bolsa

Designação Financiadores
2019/09 - 2020/08 BasNum
ANR-18-CE38-0003
Investigador
Université Grenoble Alpes, França

Inria Centre de Recherche de Paris, ALMAnaCH Lab, França

Projeto

Designação Financiadores
2016 - 2018 PARTHENOS
Bolseiro de Doutoramento
Inria Centre de Recherche de Paris, ALMAnaCH Lab, França
Concluído

Outro

Designação Financiadores
2018 - Atual Paris Time Machine Consortium
Investigador
Inria Centre de Recherche de Paris, ALMAnaCH Lab, França
Em curso
2018/04 - 2020/11 DISCO
Bolseiro de Investigação
Inria Centre de Recherche de Paris, ALMAnaCH Lab, França
Em curso
2017 - 2018 Nénufar
Bolseiro de Investigação
Inria Centre de Recherche de Paris, ALMAnaCH Lab, França
Concluído
Produções

Publicações

Artigo em conferência
  1. Khemakhem, Mohamed. "Information Extraction Workflow for Digitised Entry-based Documents". 2020.
  2. Khemakhem, Mohamed. "Selling autograph manuscripts in 19th c. Paris: digitising the Revue des Autographes". 2020.
  3. Khemakhem, Mohamed. "Nénufar: Modelling a Diachronic Collection of Dictionary Editions as a Computational Lexical Resource". 2019.
  4. Khemakhem, Mohamed. "TEI Encoding of a Classical Mixtec Dictionary Using GROBID- Dictionaries". 2019.
  5. Khemakhem, Mohamed. "Scaling up Automatic Structuring of Manuscript Sales Catalogues". 2019.
  6. Khemakhem, Mohamed. "How OCR Performance can Impact on the Automatic Extraction of Dictionary Content Structures". 2019.
  7. Khemakhem, Mohamed. "Historical Dictionaries as Digital Editions and Connected Graphs: the Example of Le Petit Larousse Illustré". 2019.
  8. Khemakhem, Mohamed. "LMF Reloaded". 2019.
  9. Khemakhem, Mohamed. "Retro-digitizing and Automatically Structuring a Large Bibliography Collection". 2018.
  10. Khemakhem, Mohamed. "Automatically Encoding Encyclopedic-like Resources in TEI". 2018.
  11. Khemakhem, Mohamed. "Fueling Time Machine: Information Extraction from Retro-Digitised Address Directories". 2018.
  12. Khemakhem, Mohamed. "Presenting the Nénufar Project: a Diachronic Digital Edition of the Petit Larousse Illustré". 2018.
  13. Khemakhem, Mohamed. "Enhancing Usability for Automatically Structuring Digitised Dictionaries". 2018.
  14. Khemakhem, Mohamed. "Automatic Extraction of TEI Structures in Digitized Lexical Resources using Conditional Random Fields". 2017.
  15. Khemakhem, Mohamed. "Sense-annotating a Lexical Substitution Data Set with Ubyline". 2016.

Outros

Outra produção
  1. GROBID and catalogues. 2018. Khemakhem, Mohamed. https://hal.archives-ouvertes.fr/cel-01951107.
  2. A Diachronic Digital Edition of the Petit Larousse illustré. 2018. Khemakhem, Mohamed. https://hal.archives-ouvertes.fr/hal-01873805.
Atividades

Organização de evento

Nome do evento
Tipo de evento (Tipo de participação)
Instituição / Organização
2018 - 2018 GROBID-Camp Spring 2018 – Inria, Paris (2018 - 2018)
Oficina (workshop)
2017 - 2017 GROBID-Camp Summer 2017 - ResearchGate, Berlin (2017 - 2017)
Oficina (workshop)

Arbitragem científica em conferência

Nome da conferência Local da conferência
2016 - 2016 10th edition of the Language Resources and Evaluation Conference (LREC), 23-28 May 2016, Portorož (Slovenia)

Consultoria / Parecer

Descrição da atividade Instituição / Organização
2019 - Atual Co-Project Leader of ISO 24613-4 “TEI-Serialization” (https://www.iso.org/standard/75411.html) International Organization for Standardization, Suiça
2016/06 - Atual - Study approaches and techniques for structuring modern and legacy digitized dictionaries; - Design and Implementation of an open source machine learning system for parsing and structuring digitized dictionaries; - Design the architecture of GROBID-Dictionaries (https://github.com/MedKhem/grobid-dictionaries) to cover more entry-based documents. Inria Centre de Recherche de Paris, ALMAnaCH Lab, França
2019/09 - 2020/08 Customise GROBID-Dictionaries (https://github.com/MedKhem/grobid-dictionaries) to support the structuring of legacy dictionaries (Basnage de Beauval dictionary). Inria Centre de Recherche de Paris, ALMAnaCH Lab, França
2014/09 - 2016/03 - Study the manual and semi-automatic techniques for linking corpora and lexical resources for the purpose of semantic annotation; - Get familiarized with DKPro, a repository of NLP tools based on Apache UIMA framework; - Design and implementation of UbyLine for the extraction and annotation of usage examples from corpora to be linked to entries in a lexical resource (UBY). Ubiquitous Knowledge Processing (UKP) Lab, Alemanha
2012/02 - 2012/12 - Adaptation of collaborative techniques for lexicographic tasks; - Design and implementation of a web based system for interactive query and collaborative enrichment of an Arabic dictionary, an instantiation of the ISO standard Lexicon Markup Framework (LMF). MIRACL Lab, Tunísia

Curso / Disciplina lecionado

Disciplina Curso (Tipo) Instituição / Organização
2018/12/03 - 2018/12/07 GROBID-Dictionaries workshop at Lexical data Masterclass 2018 – BBAW, Berlin Berlin Brandenburg Academy of Sciences (BBAW), Alemanha
2018/11/02 - 2018/11/02 GROBID-Dictionaries workshop at Stellenbosch University 2018 – SADiLaR, Stellenbosch Stellenbosch Institute for Advanced Studies (STIAS), África do Sul
2018/10/30 - 2018/10/30 GROBID-Dictionaries workshop at North-West University 2018 – SADiLaR, Potchefstroom South African Centre for Digital Language Resources (SADiLaR), África do Sul
2018/10/26 - 2018/10/26 GROBID-Dictionaries workshop at University of Pretoria 2018 – SADiLaR, Pretoria University of Pretoria, África do Sul
2018/06/26 - 2018/06/29 GROBID-Dictionaries workshop at CAHIER 2018 – Praxiling, Montpellier Université Paul-Valéry Montpellier 3, França
2017/12/04 - 2017/12/08 GROBID-Dictionaries workshop at Lexical Data Masterclass 2017 – BBAW, Berlin Berlin Brandenburg Academy of Sciences (BBAW), Alemanha

Membro de associação

Nome da associação Tipo de participação
2018 - Atual Member of PARTHENOS Project (https://www.parthenos-project.eu/) Member

Membro de comissão

Descrição da atividade
Tipo de participação
Instituição / Organização
2019 - Atual Member of DIN NA 105-00-66 AA committee “Language Resource Management”
Consultor
Deutsches Institut für Normung eV, Alemanha
2019 - Atual Member of AFNOR/X03A committee “Terminologie - principes et coordination”
Consultor
Association Française de Normalisation, França
2019 - Atual Member of the ISO/TC 37/SC4/WG4 “Language resource management”
Consultor
International Organization for Standardization, Suiça
2019 - Atual DARIAH BiblioData Working Group (https://www.dariah.eu/activities/working-groups/bibliographical-data-bibliodata/)
Membro
2018 - Atual Member of DARIAH-ERIC Working Group "Lexical Resources" (https://dariah-eric.github.io/lexicalresources/)
Membro
2018 - Atual Member of Groupe annuaires et adresses at Paris Time Machine (https://paris-timemachine.huma-num.fr/groupe-adresses-et-annuaires/)
Membro