???global.info.a_carregar???
- I have worked in Natural Language Processing, a sub-area of Artificial Intelligence, since the year 2000, having participated in multiple national and European research projects and authored or co-authored around 70 peer-reviewed publications. - I am the Scientific Resources and Users Support Manager of the PORTULAN CLARIN Research Infrastructure for the Science and Technology of Language, part of the Portuguese National Network of Research Infrastructures of Strategic Interest and national node of the European Infrastructure CLARIN ERIC. - I am a founding member of NLX, the Natural Language and Speech Group of the Faculty of Sciences of the University of Lisbon, a R&D unit in Natural Language Processing, with a particular focus on the Portuguese language. - I have developed or co-developed multiple natural language processing tools for Portuguese with state-of-the-art performance. - I have developed or co-developed multiple high-quality data sets and annotated corpora for Portuguese. - I have supervised or co-supervised multiple MSc theses and I am currently co-supervising a PhD thesis. - I am a member of the Mind-Brain College of the University of Lisbon. - I was (2015-2017) an Invited Teacher at the Department of Informatics of the Faculty of Sciences of the University of Lisbon. - I was a member of the Management Committee of PROPOR, the International Conference on the Computational Processing of Portuguese. - I was a member of the Management Committee for Portugal of the enetCollect COST Action (CA16105).
Identification

Personal identification

Full name
João Ricardo Martins Ferreira da Silva

Citation names

  • Silva, João

Author identifiers

Ciência ID
5C1E-2B08-22DD
ORCID iD
0000-0002-6490-1807
Google Scholar ID
yBwktxgAAAAJ
Researcher Id
DVE-2747-2022

Knowledge fields

  • Exact Sciences - Computer and Information Sciences

Languages

Language Speaking Reading Writing Listening Peer-review
English Advanced (C1) Advanced (C1) Advanced (C1) Advanced (C1)
Education
Degree Classification
2014
Concluded
Doutoramento em Informática (Doutoramento)
Universidade de Lisboa Faculdade de Ciências, Portugal
"Robust Handling of Out-of-Vocabulary Words in Deep Language Processing" (THESIS/DISSERTATION)
Aprovado com Distinção e Louvor
2007
Concluded
Mestrado em Informática (Mestrado)
Universidade de Lisboa Faculdade de Ciências, Portugal
"Shallow Processing of Portuguese: From Sentence Chunking to Nominal Lemmatization" (THESIS/DISSERTATION)
Muito Bom
2004
Concluded
Licenciatura em Informática (Licenciatura)
Universidade de Lisboa Faculdade de Ciências, Portugal
15
Affiliation

Science

Category
Host institution
Employer
2024/10/01 - Current Researcher (Research) FCiênciasID Associação para a Investigação e Desenvolvimento de Ciências, Portugal
Universidade de Lisboa Faculdade de Ciências, Portugal
2019 - 2024/09/30 Researcher (Research) Universidade de Lisboa Faculdade de Ciências, Portugal
Universidade de Lisboa Faculdade de Ciências, Portugal
2016/10/01 - 2019 Postdoc (Research) Universidade de Lisboa Faculdade de Ciências, Portugal
Universidade de Lisboa Faculdade de Ciências, Portugal
2015/11/15 - 2016/09/30 Postdoc (Research) Universidade de Lisboa Faculdade de Ciências, Portugal
Universidade de Lisboa Faculdade de Ciências, Portugal
2015/05/01 - 2015/11/14 Postdoc (Research) Universidade de Lisboa Faculdade de Ciências, Portugal
Universidade de Lisboa Faculdade de Ciências, Portugal
2014/09/01 - 2015/04/30 Postdoc (Research) Universidade de Lisboa Faculdade de Ciências, Portugal
Universidade de Lisboa Faculdade de Ciências, Portugal
2013/02/01 - 2014/08/31 Research Assistant (Research) Universidade de Lisboa Faculdade de Ciências, Portugal
Universidade de Lisboa Faculdade de Ciências, Portugal
2012/07/01 - 2013/01/31 Research Assistant (Research) Universidade de Lisboa Faculdade de Ciências, Portugal
Universidade de Lisboa Faculdade de Ciências, Portugal
2008/01/01 - 2008/06/30 Research Assistant (Research) Universidade de Lisboa Faculdade de Ciências, Portugal
Universidade de Lisboa Faculdade de Ciências, Portugal
2006/07/01 - 2007/12/31 Research Assistant (Research) Universidade de Lisboa Faculdade de Ciências, Portugal
Universidade de Lisboa Faculdade de Ciências, Portugal
2004/03/01 - 2006/07/31 Research Assistant (Research) Universidade de Lisboa Faculdade de Ciências, Portugal
Universidade de Lisboa Faculdade de Ciências, Portugal
2000/02/01 - 2002/02/28 Research Assistant (Research) Universidade de Lisboa Faculdade de Ciências, Portugal
Universidade de Lisboa Faculdade de Ciências, Portugal

Teaching in Higher Education

Category
Host institution
Employer
2015/09/01 - 2017/09/11 Invited Assistant Professor (University Teacher) Universidade de Lisboa Faculdade de Ciências, Portugal
Universidade de Lisboa Faculdade de Ciências, Portugal

Others

Category
Host institution
Employer
2008/07/01 - 2012/06/30 PhD Grant from FCT (SFRH/BD/41465/2007) Universidade de Lisboa Faculdade de Ciências, Portugal
2003/06/01 - 2005/07/30 Google International Agent Google Inc, United States
Projects

Contract

Designation Funders
2024 - Current IMPROMPT - Image Alteration with Language Prompts
CPCA-IAC/AV/590897/2023
Principal investigator
Google Inc

Fundação para a Ciência e a Tecnologia
Ongoing
2019 - Current PORTULAN CLARIN - Research Infrastruture for the Science and Technology of Language
PINFRA/22117/2016
Researcher
Fundação para a Ciência e a Tecnologia
Ongoing
2023/01 - 2024/01 Language Driven Image Design with Diffusion
2022.15880.CPCA.A1
Principal investigator
Rede Nacional de Computação Avançada
Concluded
2022/12 - 2023/12 GPTPT - Transformer-based Decoder for the Portuguese Language
CPCA-IAC/AV/478395/2022
Principal investigator
Google Inc

Fundação para a Ciência e a Tecnologia
Concluded
2022/12 - 2023/12 ALBERTINA - Foundation Encoder Model for Portuguese and AI
CPCA-IAC/AV/478394/2022
Co-Principal Investigator (Co-PI)
Google Inc

Fundação para a Ciência e a Tecnologia
Concluded
2016 - 2019 ASSET - Intelligent Assistance for Everyone Everywhere
P2020/3279
Research Fellow
Fundação para a Ciência e a Tecnologia
Concluded
2013 - 2016 QTLeap - Quality Translation with Deep Language Engineering Approaches
FP7/610516
Research Fellow
European Commission
Concluded
2013/05/15 - 2015/11/14 DP4LT - Processamento Profundo para a Tecnologia da Linguagem
PTDC/EEI-SII/1940/2012
Research Fellow
Fundação da Faculdade de Ciências da Universidade de Lisboa, Portugal
Fundação para a Ciência e a Tecnologia
Concluded
2012 - 2013 METANET4U - Enhancing the European Linguistic Infrastructure
EC/ICTPSP/270893
Research Fellow
European Commission
Concluded
2008/02/19 - 2010/12/31 SemanticShare - Ferramentas e Recursos para o Processamento Semântico
PTDC/PLP/81157/2006
Research Fellow
Universidade de Lisboa Faculdade de Ciências, Portugal

Fundação da Faculdade de Ciências da Universidade de Lisboa, Portugal
Fundação para a Ciência e a Tecnologia
Concluded
2005/08/18 - 2007/12/31 QueXting - Responder a Perguntas com base na Web Portuguesa
POSC/PLP/61490/2004
Research Fellow
Universidade de Lisboa Faculdade de Ciências, Portugal

Fundação da Faculdade de Ciências da Universidade de Lisboa, Portugal
Fundação para a Ciência e a Tecnologia
Concluded
2004/03/01 - 2006/07/31 TAGSHARE - Ferramentas e Recursos para a Etiquetagem e Processamento Morfossintáctico Superficial
POSI/PLP/47058/2002
Research Fellow
Fundação para a Ciência e a Tecnologia
Concluded
2000/10/01 - 2003/06/30 NEXING - Modelação e Processamento da Negação Natural
POSI/PLP/34076/2000
Research Fellow
Fundação para a Ciência e a Tecnologia
Concluded
Outputs

Publications

Book chapter
  1. Branco, António; Sara Grilo; Silva, João. "Language Report Portuguese". In European Language Equality: A Strategic Agenda for Digital Language Equality, edited by Georg Rehm; Andy Way. Springer, 2023.
    Published
  2. Silva, João; Grilo, Sara; Bolrinha, Márcia; Santos, Rodrigo; Gomes, Luís; Branco, António; Vaz, Rui. "Where do I belong in six centuries of literature? Datasets and AI-based tools for Portuguese literary documentos made possible and available by PORTULAN CLARIN". In CLARIN: The infrastructure for language resources. De Gruyter, 2022.
    Published
  3. Gomes, Luís; Branco, Ruben; Silva, João; Branco, António. "Open and inclusive language processing: Language processing services by PORTULAN to meet the widest needs of CLARIN user". In CLARIN: The Infrastructure for Language Resources. De Gruyter, 2022.
    Published
  4. Branco, António; Silva, João. "Swift Development of State of the Art Taggers for Portuguese". In Language Technology for Portuguese: Shallow Processing Tools and Resources, 29-45. Edições Colibri, 2004.
    Published
Conference paper
  1. Santos, Rodrigo; Silva, João; Gomes, Luís; Rodrigues, João; Branco, António. "Advancing Generative AI for Portuguese with Open Decoder Gervásio PT*". Paper presented in Annual Meeting of the ELRA-ISCA Special Interest Group on Under-resourced Languages (SIGUL2024), 2024.
    In press • 10.48550/ARXIV.2402.18766
  2. Santos, Rodrigo; Rodrigues, João; Gomes, Luís; Silva, João; Branco, António; Lopes Cardoso, Henrique; Osório, Tomás; Leite, Bernardo. "Fostering the Ecosystem of Open Neural Encoders for Portuguese with Albertina PT* Family". Paper presented in Annual Meeting of the ELRA-ISCA Special Interest Group on Under-resourced Languages (SIGUL2024), 2024.
    In press • 10.48550/ARXIV.2403.01897
  3. Santos, Rodrigo; Silva, João; Branco, António. "Leveraging LLMs for On-the-fly Instruction Guided Image Editing". Paper presented in Portuguese Conference on Artificial Intelligence (EPIA), 2024.
    Published
  4. Gomes, Luís; Branco, António; Silva, João; Rodrigues, João; Santos, Rodrigo. "Open Sentence Embeddings for Portuguese with the Serafim PT* encoders family". Paper presented in Portuguese Conference on Artificial Intelligence (EPIA), 2024.
    Published
  5. Rodrigues, João; Gomes, Luís; Silva, João; Branco, António; Santos, Rodrigo; Lopes Cardoso, Henrique; Osório, Tomás. "Advancing Neural Encoding of Portuguese with Transformer Albertina PT-*". Paper presented in Portuguese Conference on Artificial Intelligence (EPIA), 2023.
    Published
  6. Branco, António; Silva, João; Gomes, Luís; Rodrigues, João. "Universal Grammatical Dependencies for Portuguese with CINTIL Data, LX Processing and CLARIN support". Paper presented in International Conference on Language Resources and Evaluation (LREC), 2022.
    Published
  7. Santos, Rodrigo; Branco, António; Silva, João. "Language driven image editing via Transformers". Paper presented in IEEE International Conference on Technological Advancements and Innovation (ICTAI 2022), 2022.
    Published
  8. Santos, Rodrigo; Branco, António; Silva, João. "Cost-effective language driven image editing with LX-DRIM". Paper presented in First Workshop on Performance and Interpretability Evaluations of Multimodal, Multipurpose, Massive-Scale Models (MMMPIE, co-located with COLING 2022), 2022.
    Published
  9. Santos, Rodrigo; Silva, João; Branco, António. "More Data is Better Only to Some Level, After Which it is Harmful: Profiling Neural Machine Translation Self-Learning with Back-Translation". Paper presented in Conference on Artificial Intelligence (EPIA), 2021.
    Published
  10. Branco, Ruben; Branco, António; Rodrigues, João; Silva, João. "Commonsense Reasoning: How do Neuro-Only and Hybrid Neuro-Symbolic Approaches Compare?". Paper presented in International Conference on Information and Knowledge Management (CIKM): Workshop on Knowledge Injection in Neural Networks (KINN), 2021.
    Published
  11. Branco, Ruben; Branco, António; Rodrigues, João; Silva, João. "Shortcutted Commonsense: Data Spuriousness in Deep Learning of Commonsense Reasoning". Paper presented in Empirical Methods in Natural Language Processing (EMNLP), 2021.
    Published
  12. Santos, Rodrigo; Silva, João; Branco, António. "Making the Most of Synthetic Parallel Texts: Portuguese-Chinese Neural Machine Translation Enhanced with Back-Translation". Paper presented in International Conference on the Computational Processing of Portuguese (PROPOR), 2020.
    Published
  13. Branco, António; Mendes, Amália; Quaresma, Paulo; Gomes, Luís; Silva, João; Teixeira, Andrea. "Infrastructure for the Science and Technology of Language: PORTULAN CLARIN". Paper presented in International Workshop on Language Technology Platforms (IWLTP), 2020.
    Published
  14. Grilo, Sara; Bolrinha, Márcia; Silva, João; Vaz, Rui; Branco, António. "The BDCamões Collection of Portuguese Literary Documents: a Research Resource for Digital Humanities and Language Technology". Paper presented in International Conference on Language Resources and Evaluation (LREC), 2020.
    Published
  15. Branco, António; Grilo, Sara; Bolrinha, Márcia; Saedi, Chakaveh; Branco, Ruben; Silva, João; Querido, Andreia; et al. "The MWN.PT WordNet for Portuguese: Projection, Validation, Cross-lingual Alignment and Distribution". Paper presented in International Conference on Language Resources and Evaluation (LREC), 2020.
    Published
  16. Rodrigues, João; Branco, Ruben; Silva, João; Branco, António. "Reproduction and Revival of the Argument Reasoning Comprehension Task". Paper presented in International Conference on Language Resources and Evaluation (LREC), 2020.
    Published
  17. Branco, António; Calzolari, Nicoletta; Vossen, Piek; van Noord, Gertjan; van Uytvanck, Dieter; Silva, João; Gomes, Luís; Moreira, André; Elbers, Williem. "A Shared Task of a New, Collaborative Type to Foster Reproducibility: A First Exercise in the Area of Language Science and Technology with REPROLANG2020". Paper presented in International Conference on Language Resources and Evaluation (LREC), 2020.
    Published
  18. Santos, Rodrigo; Silva, João; Branco, António; Xiong, Deyi. "The Direct Path May Not Be the Best: Portuguese-Chinese Neural Machine Translation". Paper presented in Conference on Artificial Intelligence (EPIA), 2019.
    Published
  19. Branco, António; Branco, Ruben; Saedi, Chakaveh; Silva, João. "Browsing and Supporting Pluricentric Global Wordnet, or just your Wordnet of Interest". Paper presented in International Conference on Language Resources and Evaluation (LREC), 2018.
    Published
  20. Rodrigues, João; Saedi, Chakaveh; Branco, António; Silva, João. "Semantic Equivalence Detection: Are Interrogatives Harder than Declaratives?". Paper presented in International Conference on Language Resources and Evaluation (LREC), 2018.
    Published
  21. Rodrigues, João; Branco, Ruben; Silva, João; Saedi, Chakaveh; Branco, António. "Predicting Brain Activation with WordNet Embeddings". 2018.
    Published • 10.18653/v1/w18-2801
  22. Saedi, Chakaveh; Branco, António; Rodrigues, João; Silva, João. "WordNet Embeddings". Paper presented in Workshop on Representation Learning for NLP (RepL4NLP), 2018.
    Published • 10.18653/v1/w18-3016
  23. Gomes, Luís; Apolónia, Frederico; Branco, Ruben; Silva, João; Branco, António. "Setting up the PORTULAN / CLARIN repository". Paper presented in CLARIN Annual Conference, 2018.
    Published
  24. Saedi, Chakaveh; Rodrigues, João; Silva, João; Branco, António; Maraev, Vladislav. "Learning Profiles in Duplicate Question Detection". Paper presented in IEEE International Conference on Information Reuse and Integration (IRI), 2017.
    Published • 10.1109/iri.2017.39
  25. Rodrigues, João; Saedi, Chakaveh; Maraev, Vladislav; Silva, João; Branco, António. "Ways of Asking and Replying in Duplicate Question Detection". Paper presented in Joint Conference on Lexical and Computational Semantics (*SEM), 2017.
    Published • 10.18653/v1/s17-1030
  26. Maraev, Vladislav; Saedi, Chakaveh; Rodrigues, João; Branco, António; Silva, João. "Character-Level Convolutional Neural Networks for Paraphrase Detection and Other Experiments". Paper presented in Artificial Intelligence and Natural Language Conference (AINL), 2017.
    Published
  27. Otegi, Arantxa; Aranberri, Nora; Branco, António; Hajic, Jan; Neale, Steven; Osenova, Petya; Pereira, Rita; et al. "QTLeap WSD/NED Corpora: Semantic Annotation of Parallel Corpora in Six Languages". 2016.
  28. Carvalho, Rita de; Querido, Andreia; Campos, Marisa; Pereira, Rita Valadas; Silva, João; Branco, António. "CINTIL DependencyBank PREMIUM. A corpus of grammatical dependencies for Portuguese". Paper presented in International Conference on Language Resources and Evaluation (LREC), 2016.
    Published
  29. Gaudio, Rosa; Labaka, Gorka; Agirre, Eneko; Osenova, Petya; Simov, Kiril; Popel, Martin; Oele, Dieke; et al. "SMT and Hybrid systems of the QTLeap project in the WMT16 IT-task". Paper presented in Conference on Machine Translation (WMT), 2016.
    Published • 10.18653/v1/w16-2332
  30. Artetxe, Mikel; Labaka, Gorka; Saedi, Chakaveh; Rodrigues, João; Silva, João; Branco, António; Agirre, Eneko. "Adding Syntactic Structure to Bilingual Terminology for Improved Domain Adaptation". Paper presented in Deep Machine Translation Workshop (DeepMT), 2016.
    Published
  31. Rodrigues, João; Branco, António; Neale, Steven; Silva, João. "LX-DSemVectors: Distributional Semantics Models for Portuguese". Paper presented in International Conference on the Computational Processing of Portuguese (PROPOR), 2016.
    Published
  32. Neale, Steven; Pereira, Rita Valadas; Silva, João; Branco, António; Valadas, Rita; Silva, João; Branco, António. "Lexical Semantics Annotation for Enriched Portuguese Corpora". Paper presented in International Conference on the Computational Processing of Portuguese (PROPOR), 2016.
    Published • 10.1007/978-3-319-41552-9 30
  33. Rodrigues, João; Gomes, Luís; Neale, Steven; Querido, Andreia; Rendeiro, Nuno; Štajner, Sanja; Silva, João; Branco, António. "Domain-Specific Hybrid Machine Translation from English to Portuguese". Paper presented in International Conference on the Computational Processing of Portuguese (PROPOR), 2016.
    Published
  34. Silva, João; Rodrigues, João; Gomes, Luís; Branco, António. "Bootstrapping a hybrid deep MT system". Paper presented in Workshop on Hybrid Approaches to Translation (HyTra), 2015.
    Published • 10.18653/v1/w15-4101
  35. Neale, Steven; Silva, João; Branco, António. "An Accessible Interface Tool for Manual Word Sense Annotation". Paper presented in Joint ACL-ISO Workshop on Interoperable Semantic Annotation (ISA), 2015.
    Published
  36. Branco, António; Carvalheiro, Catarina; Costa, Francisco; Castro, Sérgio; Silva, João; Martins, Cláudia; Ramos, Joana. "DeepBankPT and companion Portuguese Treebanks in a Multilingual Collection of Treebanks". Paper presented in International Conference on the Computational Processing of Portuguese (PROPOR), 2014.
    Published
  37. Branco, António; Rodrigues, João; Costa, Francisco; Silva, João; Vaz, Rui. "Rolling out Text Categorization for Language Learning Assessment Supported by Language Technology". Paper presented in International Conference on the Computational Processing of Portuguese (PROPOR), 2014.
    Published
  38. Branco, António; Rodrigues, João; Costa, Francisco; Silva, João; Vaz, Rui. "Assessing automatic text classification for interactive language learning". Paper presented in IEEE International Conference on Information Society (i-Society), 2014.
    Published • 10.1109/i-society.2014.7009014
  39. Rodrigues, João; Costa, Francisco; Silva, João; Branco, António. "Automatic Syllabification of Portuguese". Paper presented in Encontro Anual da Associação Portuguesa de Linguística (APL), 2014.
    Published
  40. Branco, António; Carvalheiro, Catarina; Pereira, Sílvia; Silveira, Sara; Silva, João; Castro, Sérgio; Graça, João. "A PropBank for Portuguese: the CINTIL-PropBank". Paper presented in International Conference on Language Resources and Evaluation (LREC), 2012.
    Published
  41. Silva, João; Branco, António. "Deep, Consistent and Also Useful: Extracting Vistas from Deep Corpora for Shallower Task". Paper presented in Workshop on Advanced Treebanking at LREC, 2012.
    Published
  42. Silva, João; Branco, António. "Assigning Deep Lexical Types Using Structured Classifier Features for Grammatical Dependencies". Paper presented in Joint Workshop on Statistical Parsing and Semantic Processing of Morphologically Rich Languages at ACL, 2012.
    Published
  43. Silva, João; Branco, António. "Assigning Deep Lexical Types". Paper presented in International Conference on Text, Speech and Dialogue (TSD), 2012.
    Published
  44. Silva, João; Branco, António; Castro, Sérgio; Reis, Ruben. "Out-of-the-Box Robust Parsing of Portuguese". Paper presented in International Conference on the Computational Processing of Portuguese (PROPOR), 2010.
    Published
  45. Silva, João; Branco, António; Nunes, Patricia. "Top-performing Robust Parsing of Portuguese: Freely available in as many ways as you can get it". Paper presented in International Conference on Language Resources and Evaluation (LREC), 2010.
    Published
  46. Branco, António; Costa, Francisco; Silva, João; Silveira, Sara; Castro, Sérgio; Avelãs, Mariana; Pinto, Clara; Graça, João. "Developing a Deep Linguistic Databank Supporting a Collection of Treebanks: the CINTIL DeepGramBank". Paper presented in International Conference on Language Resources and Evaluation (LREC), 2010.
    Published
  47. Branco, António; Costa, Francisco; Ferreira, Eduardo; Martins, Pedro; Nunes, Filipe; Silva, João; Silveira, Sara. "LX-Center: A Center of Online Linguistic Services". Paper presented in ACL-IJCNLP, 2009.
    Published
  48. Branco, António; Costa, Francisco; Ferreira, Eduardo; Martins, Pedro; Nunes, Filipe; Silva, João; Silveira, Sara. "LX-Center: A Center of Online Services for Education, Research and Development on Language Science and Technology". 2009.
    Published
  49. Branco, António; Costa, Francisco; Martins, Pedro; Nunes, Filipe; Silva, João; Silveira, Sara. "LXService: Web Services of Language Technology for Portuguese". Paper presented in International Conference on Language Resources and Evaluation (LREC), 2008.
    Published
  50. Branco, António; Rodrigues, Lino; Silva, João; Silveira, Sara. "Real-Time Open-Domain QA on the Portuguese Web". Paper presented in Ibero-American Conference on AI (IBERAMIA), 2008.
    Published
  51. Branco, António; Silva, João; Rodrigues, Lino; Silveira, Sara. "XisQuê: An Online QA Service for Portuguese". Paper presented in International Conference on the Computational Processing of Portuguese (PROPOR), 2008.
    Published
  52. Branco, António; Silva, João. "Very High Accuracy Rule-Based Nominal Lemmatization with a Minimal Lexicon". Paper presented in Encontro Anual da Associação Portuguesa de Linguística (APL), 2007.
    Published
  53. Barreto, Florbela; Branco, António; Ferreira, Eduardo; Mendes, Amália; Bacelar do Nascimento, Maria Fernanda; Nunes, Filipe; Silva, João. "Open Resources and Tools for the Shallow Processing of Portuguese: the TagShare project". Paper presented in International Conference on Language Resources and Evaluation (LREC), 2006.
    Published
  54. Barreto, Florbela; Branco, António; Ferreira, Eduardo; Mendes, Amália; Bacelar do Nascimento, Maria Fernanda; Nunes, Filipe; Silva, João. "Linguistic Resources and Software for Shallow Processing". Paper presented in Encontro Anual da Associação Portuguesa de Linguística de Lisboa, 2006.
    Published
  55. Branco, António; Silva, João. "A Suite of Shallow Processing Tools for Portuguese: LX-Suite". Paper presented in European Chapter of the Association for Computational Linguistics (EACL), 2006.
    Published • 10.3115/1608974.1609003
  56. Branco, António; Silva, João. "Evaluating Solutions for the Rapid Development of State-of-the-Art POS Taggers for Portuguese". Paper presented in International Conference on Language Resources and Evaluation (LREC), 2004.
    Published
  57. Branco, António; Silva, João. "Morpho-syntactic Tagging without Training Corpus or Lexicon: How Far is it Possible to Get?". Paper presented in Encontro Anual da Associação Portuguesa de Linguística (APL), 2003.
    Published
  58. Branco, António; Silva, João. "A Metric for the Efficency of Accurate Tagging Procedures". Paper presented in International Conference on Recent Advances in Natural Language Processing (RANLP), 2003.
    Published
  59. Branco, António; Silva, João. "Portuguese-Specific Issues in the Rapid Development of State of the Art Taggers". Paper presented in Tagging and Shallow Processing of Portuguese (TASHA), 2003.
    Published
  60. Branco, António; Leitão, José; Silva, João. "Nexing Corpus: A Corpus of Verbal Protocols on Syllogistic Reasoning". Paper presented in International Conference on Language Resources and Evaluation (LREC), 2002.
    Published
  61. Branco, António; Silva, João. "EtiFac: A Facilitating Tool for Manual Tagging". Paper presented in Encontro Anual da Associação Portuguesa de Linguística (APL), 2001.
    Published
Journal article
  1. Gomes, Luís; Branco, António; Silva, João; Branco, Ruben. "From greatest simplicity to full power". Language Resources and Evaluation (2024): http://dx.doi.org/10.1007/s10579-024-09772-6.
    Published • 10.1007/s10579-024-09772-6
  2. Querido, Andreia; Carvalho, Rita; Rodrigues, João; Garcia, Marcos; Silva, João; Correia, Catarina; Rendeiro, Nuno; et al. "LX-LR4DistSemEval: A Collection of Language Resources for the Evaluation of Distributional Semantic Models of Portuguese". Revista da Associação Portuguesa de Linguística 3 (2017):
    Published
  3. Querido, Andreia; Carvalho, Rita de; Rodrigues, João; Silva, João; Neale, Steven; Pereira, Rita; Gomes, Patrícia; et al. "Named Entities in the QTLeap Corpus of Online Helpdesk Interactions". Revista da Associação Portuguesa de Linguística 2 (2016): 459-474. http://hdl.handle.net/10451/33105.
    Published • 10.21747/2183-9077/rapl2a20
  4. Branco, António; Silva, João. "Dedicated Nominal Featurization of Portuguese". Lecture Notes in Artificial Intelligence (LNAI) 3960 (2006):
    Published
  5. Branco, António; Silva, João. "Accurate Annotation: an Efficiency Metric". Recent Advances in Natural Language Processing (RANLP) III (2005): 173-182.
    Published
  6. Branco, António; Silva, João. "Contractions: Breaking the Tokenization-Tagging Circularity". Lecture Notes in Artificial Intelligence (LNAI) 2721 (2003): 167-170.
    Published
Report
  1. Branco, António; Silva, João; Querido, Andreia; Carvalho, Rita. 2015. CINTIL DependencyBank PREMIUM Handbook: Design options for the representation of grammatical dependencies. http://hdl.handle.net/10451/20226.
  2. Branco, António; Silva, João; Costa, Francisco; Castro, Sérgio. 2011. CINTIL TreeBank Handbook: Design Options for the Representation of Syntactic Constituency.
  3. Branco, António; Castro, Sérgio; Silva, João; Costa, Francisco. 2011. CINTIL DepBank Handbook: Design Options for the Representation of Grammatical Dependencies.
  4. Branco, António; Silva, João. 2003. Tokenization of Portuguese: resolving the hard cases. http://hdl.handle.net/10451/14199.
Thesis / Dissertation
  1. Silva, João Ricardo Martins Ferreira da, 1977-. "Robust handling of out-of-vocabulary words in deep language processing". PhD, Universidade de Lisboa Faculdade de Ciências, 2014. http://hdl.handle.net/10451/11956.
  2. Silva, João; Silva, João Ricardo Martins Ferreira da. "Shallow Processing of Portuguese: From Sentence Chunking to Nominal Lemmatization". Master, Universidade de Lisboa Faculdade de Ciências, 2007. http://hdl.handle.net/10451/14016.
Activities

Oral presentation

Presentation title Event name
Host (Event location)
2017/05 Robust Handling of Out-of-Vocabulary Words in Deep Language Processing Workshop "Verbos e Preposições"
Faculdade de Ciências da Universidade do Porto (Porto, Portugal)

Supervision

Thesis Title
Role
Degree Subject (Type)
Institution / Organization
2021 - 2023 Unsupervised Neural Machine Translation between the Portuguese language and the Chinese and Korean languages
Supervisor
Universidade de Lisboa Faculdade de Ciências, Portugal
2021 - 2022/10/31 Anonimização Automática de Dados Estruturados
Supervisor
Universidade de Lisboa Faculdade de Ciências, Portugal
2020 - 2022/03 Recognizing Emotions in Short Texts
Supervisor
Ciência Cognitiva (Master)
Universidade de Lisboa Faculdade de Ciências, Portugal
2019 - 2019 Portuguese-Chinese neural machine translation
Co-supervisor
Engenharia Informática (Master)
Universidade de Lisboa Faculdade de Ciências, Portugal

Event organisation

Event name
Type of event (Role)
Institution / Organization
2018/03 - 2019/03 3rd Annual Meeting of enetCollect (2019/03/14 - 2019/03/16)
Conference (Member of the Organising Committee)
Instituto de Engenharia de Sistemas e Computadores Investigação e Desenvolvimento em Lisboa, Portugal
2015/04 - 2016/11 12th International Conference on the Computational Processing of Portuguese (PROPOR) (2016 - 2016)
Conference (Other)
2014 - 2014 10th Delph-In Summit (2014/07/14 - 2014/07/18)
Meeting (Member of the Organising Committee)
Universidade de Lisboa Faculdade de Ciências, Portugal

Event participation

Activity description
Type of event
Event name
Institution / Organization
2022/11/07 - 2022/11/07 Round table on Artificial Intelligence and Natural Language Processing
Round table
Conversa com Investigadores
Fundacao de Computacao Cientifica Nacional, Portugal

Jury of academic degree

Topic
Role
Candidate name (Type of degree)
Institution / Organization
2023 Internacionalização semi-automática de software
(Thesis) Main arguer
Duarte Filipe Rodrigues Jardim Olival (Master)
Universidade de Lisboa Faculdade de Ciências, Portugal
2016 Atribuição de autoria em linguística forense: uma análise combinada para identificação de autor através do texto
(Thesis) Arguer
Liliana Rita de Amorim Romão Teles (Master)
Universidade de Lisboa Faculdade de Letras, Portugal

Association member

Society Organization name Role
2018/09 - Current Colégio Mente-Cérebro (Mind-Brain College)

Committee member

Activity description
Role
Institution / Organization
2018/10 - Current Representative of Portugal on the User Involvement Committee of CLARIN
Member
2018 - 2022 Comité gestor do PROPOR (International Conference on the Computational Processing of Portuguese)
Member
2017 - 2021 Management Committee de Portugal para a Ação COST enetCollect (CA16105)
Member

Course / Discipline taught

Academic session Degree Subject (Type) Institution / Organization
2019/05 - 2019/05 Workshop "eTradução" na Agência para a Modernização Adminstrativa (AMA)
2016/03 - 2016/03 Workshop "Plataforma de Tradução CEF.AT" na Representação da Comissão Europeia em Portugal
2016/01 - 2016/01 Workshop CLULunch "Introdução ao LaTeX" Universidade de Lisboa Centro de Linguística, Portugal