The 17th International Conference on Computational Processing of Portuguese will be held at Salvador, BA from Apr 13 to 16 of 2026

PROPOR 2026

The 17th International Conference on Computational Processing of Portuguese (PROPOR 2026) will be held in Salvador - Brazil, from the 13th to the 16th of April 2026.

PROPOR is the main scientific meeting in the area of language and speech technologies for the Portuguese/Galician language. The event is supported by the PROPOR steering committee.

PROPOR is a biennial event hosted in Brazil and in Portugal (and now in Galicia). Past meetings were held in Lisbon, PT (1993); Curitiba, BR (1996); Porto Alegre, BR (1998); Évora, PT (1999); Atibaia, BR (2000); Faro, PT (2003); Itatiaia, BR (2006); Aveiro, PT (2008); Porto Alegre, BR (2010); Coimbra, PT (2012), São Carlos, BR (2014), Tomar, PT (2016), Canela, BR (2018), Évora, PT (2020), Fortaleza, BR (2022), and Santiago de Compostela, GZ (2024). More details about past events, PROPOR steering committee and the constitution can be found in propor.org.

Call for Papers

PROPOR 2026: 17th International Conference on Computational Processing of Portuguese.
Salvador - Bahia April 13th to 16th 2026

The International Conference on Computational Processing of Portuguese (PROPOR) is the main event in the area of human language processing that is focused on theoretical and technological issues of written and spoken Portuguese and Galician. The meeting has been a very rich forum for the exchange of ideas and partnerships for the research and industry communities dedicated to the automated language processing, promoting the development of methodologies, resources, and projects that can be shared among researchers and practitioners in the field.

We call for papers describing work on any topic related to computational language and speech processing of Portuguese and Galician by researchers in industry or academia. Topics of interest include, but are not limited to:

  • Natural language processing tasks (e.g. parsing, word sense disambiguation, coreference resolution)
  • Natural language processing applications (e.g. question answering, subtitling, summarization, sentiment analysis)
  • Natural language generation
  • Information extraction and information retrieval
  • Speech technologies (e.g. spoken language generation, speech and speaker recognition, spoken language understanding)
  • Speech applications (e.g. spoken language interfaces, dialogue systems, speech-to-speech translation)
  • Resources, standardization and evaluation (e.g. corpora, ontologies, lexicons, grammars)
  • NLP-oriented linguistic description or theoretical analysis
  • Distributional semantics and language modeling
  • Portuguese language varieties and dialect processing (including the language varieties of Angola, Brazil, Cape Verde, East Timor, Galicia, Guinea-Bissau, Macau, Mozambique, Portugal, and Sao Tome and Principe)
  • Multilingual studies, methods, applications and resources including Portuguese/Galician

PROPOR 2026 will be held from April 13th to April 16th at Salvador - BA, Brazil, the place of contact between the Portuguese language with both indigenous languages of Brazil and the African languages brought to Brazil with the enslaved people coming from Africa, a contact that deeply influenced Brazilian Portuguese and culture.

PROPOR 2026 will be the 17th edition of the biannual PROPOR conference, hosted alternately in Brazil and Portugal. Past meetings were held in Lisbon, PT (1993); Curitiba, BR (1996); Porto Alegre, BR (1998); Évora, PT (1999); Atibaia, BR (2000); Faro, PT (2003); Itatiaia, BR (2006); Aveiro, PT (2008); Porto Alegre, BR (2010); Coimbra, PT (2012); São Carlos, BR (2014); Tomar, PT (2016); Canela, BR (2018); Évora, PT (2020); Fortaleza, BR (2022); and Santiago de Compostela, GZ (2024)

Mandatory Reviewing Workload

As the pace of research in the field continues to increase, we need to strengthen the commitment to reviewing for each paper submission. During the submission process, authors will be required to specify which co-authors are committing to cover reviewing in the event.

Ethics Policy

Authors are advised to follow the ACL Ethics Policy for submission, which can be found here

Authors are also strongly advised to follow the ACL guidelines for generative AI assistance in authorship, which can be found here

Important Dates (Update!)

  • Full and short paper submission deadline: Jan 9th, 2026 (23:59 GMT-12)Dec 7th, 2025 (23:59 GMT-12)
  • Notification of paper acceptance or rejection: Feb 11th, 2026
  • Camera-ready papers due: Mar 15th, 2026
  • Conference: April 13th - 16th, 2026

Submissions

Submissions should describe original, unpublished work. Authors are invited to submit two kinds of papers:

  • Full papers – Reporting substantial and completed work, especially those that may contribute in a significant way to the advancement of the area. Wherever appropriate, concrete evaluation results should be included. Full papers can have up to 8 content pages + 2 pages for references.
  • Short papers – Reporting small, focused contributions such as ongoing work, position papers, potential ideas to be discussed, negative results, or an interesting application nugget. Short papers can have up to 4 content pages + 1 page for references.

Each submission will be evaluated by at least two reviewers. As reviewing will be double-blind, submitted papers must be anonymized. That is, they should not contain the authors' names and affiliations. Authors must avoid self-references that reveal identity, like "We previously showed (Freitas, 1991) ...". Instead, they should prefer citations such as "Freitas (1991) previously showed ...". Separate author identification information will be required as part of the submission process. At submission time, only PDF format is accepted. For the final versions, authors of accepted papers will be given 1 extra content page to incorporate the reviews' suggestions. Authors of accepted papers will be requested to send the source files for the production of the proceedings.

While recent editions have only accepted submissions in English, this year we are pleased to also accept papers written in Portuguese, reaffirming our commitment to promoting scientific exchange in our language.

At submission time, only PDF format is accepted. For the final versions, authors of accepted papers will be given 1 extra content page to incorporate the reviews' suggestions. Authors of accepted papers will be requested to send the source files for the production of the proceedings. All submitted papers must conform to the ACL style guidelines and use the LaTeX stylesheets below:

Camera-ready Submission Instructions

The camera-ready version of the papers, as well as the copyright transfer form, should be submitted through the CMT platform. Authors should update their papers considering the reviewers' suggestions, using the updated Camera-Ready Stylesheet provided below.

Multiple-submission policy

For submissions that have been or will be submitted to other meetings or publications, this information must be provided at submission time. If a submission is accepted, authors must notify the program chairs, indicating which meeting they choose for presentation of their work. Papers that will be (or have been) published elsewhere cannot be accepted for publication or presentation.

Papers can be submitted through the CMT submission system:

PROPOR 2026 Program Chairs

  • Diana Santos (Linguateca/Universitetet i Oslo)
  • Larissa Freitas (Universidade Federal de Pelotas)

Scientific Committee

  • Adina Vladu (ILG/USC)
  • Alberto Simões (Checkmarx/2AI)
  • Aline Paes (UFF)
  • Aline Macohin (PUCPR)
  • Alipio Jorge (UP)
  • Amália Mendes (U Lisboa)
  • Ana Isabel Mata (U Lisboa)
  • Anabela Barreiro
  • António Teixeira (UA)
  • António Branco (U Lisboa)
  • Arnaldo Candido Junior (UNESP)
  • Brenda Santana (UFPEL)
  • Brett Drury (INESC-TEC)
  • Bruno Martins (UP)
  • Carlos H. Ribeiro (ITA)
  • Catarina Oliveira (UA)
  • Christopher Shulby (ELSA/UFG)
  • Cláudia Barros (IFSP)
  • Claudia Freitas
  • Cristina Mota (Linguateca)
  • Daniel Alves (UNL)
  • Daniela Claro (UFBA)
  • David Martins de Matos (INESC-ID)
  • Dennis Balreira (UFRGS)
  • Diana Santos (Linguateca/UiO)
  • Eric Laporte (U Paris Est)
  • Evandro Fonseca (Blip)
  • Evelin Amorim (UP/INESC TEC)
  • Felipe Meneguzzi (U Aberdeen)
  • Fernando Batista (INESC-ID)
  • Helena Caseli (UFSCAR)
  • Helena Cameron (IP Portoalegre/CIDEHUS)
  • Helena Vaz (UFMG)
  • Hidelberg Albuquerque (UFRPE)
  • Hugo Gonçalo Oliveira (UC)
  • Igor Caetano (USP)
  • Iria de Dios Flores (UPF)
  • Ivandré Paraboni (USP)
  • Jackson Souza (UFBA)
  • João Silva (U Lisboa)
  • Joel Carbonera (UFRGS)
  • Jorge Baptista (INESC-ID)
  • Larissa Freitas (UFPel)
  • Leandro Wives (UFRGS)
  • Leonardo Zilio (UCLouvain)
  • Lucelene Lopes (USP)
  • Marcelo Finger (USP)
  • Marcos Garcia (USC)
  • Maria Das Graças Volpe Nunes (USP)
  • Marlo Souza (UFBA)
  • Micaela Aguiar (U Minho)
  • Nádia Silva (UFG)
  • Oto Vale (UFSCAR)
  • Pablo Gamallo (USC)
  • Pablo Faria (UNICAMP)
  • Paula Cardoso (UFPA)
  • Plinio Barbosa (UNICAMP)
  • Prakash Poudyal (KU)
  • Priscila Osório
  • Purificação Silvano (UP)
  • Rafael Anchieta (IFMA)
  • Raquel Amaro (UNL)
  • Renata Vieira (U Evora)
  • Ricardo Rodrigues (UC)
  • Ricardo Ribeiro (INESC-ID)
  • Rozane Rebechi (UFRGS)
  • Roney Santos (UFBA)
  • Rui Sousa-Silva (UP)
  • Sandra Aluisio (USP)
  • Sara Mendes (UL)
  • Saullo Haniell (PUC Campinas)
  • Sebastian Pado (U Stuttgart)
  • Sheila Castilho (ADAPT)
  • Stella Tagnin (USP)
  • Suemi Higuchi (FGV)
  • Susana Duarte Martins (UNL)
  • Susana Sotelo Docío (USC)
  • Thiago Pardo (USP)
  • Valeria de Paiva (Topos)
  • Valeria Feltrim (UEM)
  • Viviane Moreira (UFRGS)
  • Vladia Pinheiro (UNIFOR)

PROPOR 2026: CALL FOR BEST PhD/MSc DISSERTATION AWARD

The PROPOR 2026 Best PhD / MSc Dissertation Award recognizes outstanding dissertations in academic research and development topics relevant to the computational processing of Portuguese and Galician. This award intends to recognize excellent young researchers in their early careers and highlight theoretical and technological issues of written and spoken Portuguese and Galician. The award is managed by the Best PhD / MSc dissertation committee.

Award winners will be invited to publish their thesis/dissertation extended abstracts in the PROPOR 2026 proceedings. They will receive a free registration to the main conference as part of the award. The two award winners and runners-up will be invited to prepare a presentation of their work for the main conference, using a particular format (A0 poster for poster presentation or slides for oral presentation).

The Award Ceremony will take place during the PROPOR 2026 Conference.

Submission Criteria and Procedure:

Eligible submissions are those from candidates who have successfully defended their Master and PhD Thesis dissertations within the three years preceding the contest submission deadline, except for thesis submitted to the previous contest held during PROPOR 2024. A letter from the primary dissertation advisor must be submitted with the extended abstract, stating that the candidate meets this eligibility criterion.

The dissertation must focus on some aspect of the written or spoken processing of any variety of Portuguese (including the language varieties of Portugal, Brazil, Cape Verde, Guinea-Bissau, Mozambique, Angola, São Tomé, Macau, Timor) or Galician.

Topics of interest include, but are not limited to:

  • Natural language processing tasks (e.g. parsing, word sense disambiguation, coreference resolution)
  • Natural language processing applications (e.g. question answering, subtitling, summarization, sentiment analysis)
  • Natural language generation
  • Information extraction and information retrieval
  • Speech technologies (e.g. spoken language generation, speech and speaker recognition, spoken language understanding)
  • Speech applications (e.g. spoken language interfaces, dialogue systems, speech-to-speech translation)
  • Resources, standardization and evaluation (e.g. corpora, ontologies, lexicons, grammars)
  • NLP-oriented linguistic description or theoretical analysis
  • Distributional semantics and language modeling
  • Portuguese language varieties and dialect processing (including the language varieties of Angola, Brazil, Cape Verde, East Timor, Galicia, Guinea-Bissau, Macau, Mozambique, Portugal, and Sao Tome and Principe)
  • Multilingual studies, methods, applications and resources including Portuguese/Galician

Important Dates

  • Full and short paper submission deadline: Jan 9th, 2025 (23:59 GMT-12)Dec 12th, 2025 (23:59 GMT-12)
  • Notification of paper acceptance or rejection: Feb 28th, 2026
  • Camera-ready papers due: Mar 15th, 2026
  • Conference: April 13th - 16th, 2026

Submissions

Each submission, consisting of two PDF files, must comply with the following:

  • An extended abstract
  • Letter from the primary dissertation/thesis supervisor.

Extended Abstract

The document must be in the form of an extended abstract that includes the nature of the problem researched, relevant theory, hypotheses tested, method, analysis and results and impacts (social, economical, technological, scientific, environmental).

Inclusion of publications, formal reports written by the author and other academic or nonacademic results from the PhD or MSc are particularly relevant and should be included in the extended abstract.

Extended abstracts must include URLs for the PDF of the complete MSc / PhD Thesis/Dissertation. We suggest placing it at the end, before references, and use of tinyurl.com for long URLs.

Extended abstracts are limited to 6 pages, including all figures, tables, and references and should begin with an abstract of 250 words or less. The abstract must be submitted in PDF format, following the same style as PROPOR 2026.

While recent editions have only accepted submissions in English, this year we are pleased to also accept papers written in Portuguese, reaffirming our commitment to promoting scientific exchange in our language.

Submissions must be sent through the CMT submission system bellow, selecting the PROPOR2026 Best Dissertations track:

PROPOR 2026 Best Dissertation Chairs

  • Marcos Garcia (Universidade de Santiago de Compostela)
  • Aline Paes (Universidade Federal Fluminense)

Scientific Committee

  • Alberto Abad (INESC-ID / IST)
  • Aline Vanin (UFCSPA)
  • Amália Mendes (Universidade de Lisboa)
  • Arnaldo Candido Junior (UNESP)
  • Daniela B. Claro (FORMAS-UFBA)
  • David Vilares (Universidade da Coruña)
  • Helena M. Caseli (Universidade Federal de São Carlos)
  • Ivandre Paraboni (Universidade de São Paulo)
  • Jorge Baptista (Universidade do Algarve & INESC-ID Lisboa)
  • Luís M. S. Gomes (FCUL)
  • Maria José Bocorny Finatto (UFRGS)
  • Magali S. Duran (Universidade de São Paulo)
  • Marcelo Finger (Universidade of São Paulo)
  • Marcos Fernandez-Pichel (Universidade de Santiago de Compostela)
  • Maria das Graças Volpe Nunes (NILC-USP)
  • Pablo Gamallo (Universidade de Santiago de Compostela)
  • Paulo Quaresma (Universidade de Évora)
  • Plinio A. Barbosa (UNICAMP)
  • Renata Vieira (CIDEHUS)
  • Ricardo Ribeiro (Iscte e INESC-ID)
  • Vladia C. M. Pinheiro (UNIFOR)

CALL FOR WORKSHOP PROPOSALS

The overall purpose of a workshop is to provide participants with the opportunity to present and discuss novel research ideas on active and emerging topics of computational processing of Portuguese, Galician and their variants.

Workshops can take on several forms including (but not limited to) being organized around emerging research areas, challenge problems and industrial applications. The organizers of approved workshops are required to announce the workshop and call for papers, gather submissions, conduct the reviewing process and decide upon the final workshop program. They must also prepare a set of workshop proceedings in an electronic version. They may choose to form organizing or program committees for assistance in these tasks.

The workshop organizers are responsible for:

  • Creating and distributing call-for-papers, call-for-participation and any other relevant advertising. The calls should make it clear that at least one author of any accepted paper must attend the event and that papers will be withdrawn if no such participation is secured with the payment of the workshop dues. The call should clearly describe the review and selection process. Finally, the calls must be framed to encourage as broad a participation as possible.
  • Creating and publishing (on time) a website with all the relevant information about the workshop.
  • Provide an extended abstract for the conference program.
  • Review (at least two independent reviewers) and select the submitted papers.
  • Schedule the presentations within the workshop.
  • It should be noted that the conference organization does not budget for free registration, accommodation, or travel expenses for the workshop organizers or their invited speakers. The workshop organizers should therefore secure any source of funding/sponsorship deemed necessary for their invited speakers.
  • Prepare electronic proceedings volumes by March 27 2026.
  • Attend the workshop and select any required session chairs.

Important Dates

  • Workshop proposals due: Dec 1st, 2025
  • Notification of decision: Dec 15th, 2025
  • Workshop web pages available and CFP: Dec 20, 2025
  • Deadline for paper submission: Feb 2nd, 2026
  • Notification to authors: Mar 10th, 2026
  • Camera-ready deadline: Mar 20, 2026
  • Prepare electronic proceedings volumes: Mar 27th, 2026 - strict deadline

Proposal Details

Proposals should be no more than three pages in length and must include

  • Description of the workshop: title, abstract, objectives, relevance and its potential impact on the NLP community and society.
  • Motivation: why a PROPOR workshop on this topic is needed.
  • Description of the target audience
  • List of core committed program committee members (2 to 3 members).
  • Preliminary list of invited speakers (if any)
  • For workshops previously held at PROPOR or other conferences, details on venue, attendance, and number of submissions from previous years should be provided.
  • For new workshops, a list of potential attendees/submissions and/or a justification of the expected attendees and submissions.
  • Relevant experience of the organizing committee.
  • Duration of the workshop (full day or half day).
  • Contact information (address, email, and phone - WhatsApp) for all organizers.
  • A draft Call for Papers.
  • Designation of a main contact person.

Proposals should be submitted via e-mail to propor2026wtst@gmail.com

CALL FOR SHARED TASK PROPOSALS

Shared tasks need to be focused on NLP for the Portuguese/Galician language.

The shared task organizers are responsible for:

  • Creating and distributing call-for-participation and any other relevant advertising. The calls should make it clear that at least one author of each participant system must attend the event. The call should clearly describe the evaluation process.
  • Creating and publishing (on time) a website with all the relevant information about the shared task.
  • Provide an extended abstract for the conference program.
  • Evaluate the submitted results.
  • It should be noted that the conference organization does not budget for free registration, accommodation, or travel expenses for the shared task organizers.
  • Prepare electronic proceedings volumes by March 27 2026.
  • Attend the conference

Important Dates

  • Shared task proposals due: Dec 1st, 2025
  • Notification of decision: Dec 15th, 2025
  • Shared task web pages available and linked to the main site: Dec 20th, 2025
  • Camera-ready of short papers of the systems participating in the Shared task: Mar 25th, 2026
  • Expected Tutorial date: Apr 13th, 2026
  • Prepare electronic proceedings volumes: Mar 27th, 2026 - strict deadline
  • Expected shared task meeting date: Apr 13th, 2026.

Proposal Details

Proposals should be no more than three pages in length and must include:

  • Description of the shared task: a title and a brief description of the topic of the task and its potential impact on the NLP community and society.
  • Description of the target audience.
  • Description of the data sets that will be used in the shared task and their readiness Sketch of how the submitted systems will be evaluated.
  • Proposed timeline for the shared task mainly including the dates for releasing trial, training and test data, and the evaluation period.
  • Contact information (address, email, and phone - WhatsApp) for all organizers.
  • Designation of a main contact person.

Submissions will be done via email to propor2026wtst@gmail.com

CALL FOR TUTORIAL PROPOSALS

Tutorials are intended to either provide a comprehensive introduction to core techniques/areas of interest or address advanced topics of language and speech processing including but not limited to the topics of the conference as stated in the call for papers. Especially encouraged are tutorials focusing on the computational processing of Portuguese, Galician and their variants.

Tutorial Speaker Responsibilities

Accepted tutorial speakers must provide an abstract of their tutorials for inclusion in the conference registration material. The description should be in the ASCII version that can be included in email announcements and published on the conference website. Tutorial speakers must provide tutorial materials, at least containing copies of the course slides, and a bibliography for the material covered in the tutorial. Each tutorial will be granted one registration to the main conference.

Important Dates

  • Tutorial proposals due: Jan 8th, 2026
  • Notification of decision: Jan 18th, 2026
  • Tutorial descriptions to be included in the event web page: Jan 28th, 2026
  • Tutorial course material due: Mar 25th, 2026
  • Expected Tutorial date: Apr 13th, 2026

Submission Details

Proposals for tutorials should contain:

  • A title and a brief description of the tutorial content and its relevance to the PROPOR community (not more than 2 pages).
  • A brief outline of the tutorial structure showing that the tutorial's core content can be covered in a two- or three-hour slot (including a coffee break).
  • The names, affiliations, email addresses, and websites of the tutorial instructors, including a one-paragraph statement of their research interests and areas of expertise.
  • A list of previous venues and approximate audience sizes, if the same or a similar tutorial has been given elsewhere.
  • A description of special requirements for technical equipment (e.g., Internet access).

Submissions will be done via email to propor2026wtst@gmail.com

PROPOR 2026: CALL FOR PAPERS ON NLP R&D IN THE INDUSTRY

As part of its effort to strengthen ties between academic research, industry, and the broader community, PROPOR includes an Industry Track dedicated to scientific work originating in industrial settings. The track welcomes research contributions that build on scientific advances, address applied scenarios, and report substantive results involving Portuguese and Galician.

The Industry Track is intended for complete research contributions, whether or not they are directly linked to deployed products, and may optionally include demonstrations to support the presentation of results. Submissions focused exclusively on product demonstrations should be directed to the Demo Track. However, joint submissions to both tracks are encouraged when work combines a strong research component with demonstrable artifacts.

The goal of the Industry Track is to showcase successful industry-driven research, highlight NLP impact in real-world contexts, and foster collaboration between academia and industry. Submissions may rely on proprietary or NDA-protected data, provided that sensitive information is appropriately anonymized and that motivation, methodology, and results are clearly described in line with PROPOR’s quality standards.

Submissions must be 4-page papers (English or Portuguese) describing objectives, methodology, data, and results. Accepted papers will receive one additional page to address reviewer feedback and will be presented in a dedicated Industry Track session. At least one author must be registered for the conference. Presentation details will be communicated upon acceptance.

Topics of interest:

The areas of interest include all topics related to theoretical and applied issues of written and spoken Portuguese and Galician, such as, but not limited to, the same topics as for the conference paper submission:

  • Natural language processing tasks (e.g. parsing, word sense disambiguation, coreference resolution)
  • Natural language processing applications (e.g. question answering, subtitling, summarization, sentiment analysis)
  • Generative AI use cases that present innovation and research value focused on written/spoken Portuguese or Galician
  • Information extraction and information retrieval
  • Speech technologies (e.g. spoken language generation, speech and speaker recognition, spoken language understanding)
  • Speech applications (e.g. spoken language interfaces, dialogue systems, speech-to-speech translation)
  • Resources, standardization and evaluation (e.g. corpora, ontologies, lexicons, grammars)
  • NLP-oriented linguistic description or theoretical analysis
  • Distributional semantics and language modeling
  • Portuguese language varieties and dialect processing (including the language varieties of Angola, Brazil, Cape Verde, East Timor, Galicia, Guinea-Bissau, Macau, Mozambique, Portugal, and Sao Tome and Principe)
  • Multilingual studies, methods, applications and resources including Portuguese/Galician

Important Dates (Updated!)

  • Industry track submission: Feb 13, 2026 (23:59 GMT-12) Feb 6, 2026(23:59 GMT-12)
  • Notification of acceptance or rejection: Mar 9th, 2026 Mar 2nd, 2026
  • Camera-ready papers due: Mar 18th, 2026 Mar 15th, 2026
  • Conference: April 13th - 16th, 2026

Submissions

Submissions should consist of an anonymous paper up to four pages of content with one extra page for references. Authors must outline the main objectives of their projects, the methodology, and the results achieved, along with a brief discussion on the impacts of these results within the company and the discoveries made in NLP. Data sources that are under NDA or customer PII can be replaced by an anonymized sample of the data and a description of the content for reviewers to be able to evaluate work goals and results.

All submissions will be blind reviewed by at least 2 evaluators and notification of acceptance will be available in the designated period.

All submitted papers must conform to the official ACL style guidelines. ACL provides style files that meet these requirements. They can be found at:

Submissions must be written in Portuguese or English in PDF format. For the final version, authors of accepted papers will be given one extra content page to consider reviewers' suggestions. Keep in mind that demonstrations have their own track, please refer to Demo Track for its purpose.

Publication

Accepted papers are expected to be published by ACL as a volume in ACL Anthology as part of the PROPOR 2026 proceedings. They will be available online. To ensure publication, at least one author of each accepted paper must complete an adequate registration for PROPOR 2026 by the early registration deadline.

Presentation format

Accepted papers will be presented at a designated Industry Track session in oral format with optional slide deck presentation - a designated time will be given to the audience to give feedback and ask questions regarding the presentation. The industry track will contact the main author after the notification of acceptance to give more information on presentation format and duration, as well as provide internet access and a projector for presentation.

Submissions must be sent through the CMT submission system bellow, selecting the PROPOR2026 Industry Track papers track:

Industry Track chairs

  • Clarissa Castellã Xavier (Instituto Federal do Rio Grande do Sul)
  • Henrico Bertini Brum (Sinch AB)

Scientific Committee

  • Beatriz Fagundes (Clio)
  • Fabio Rezende de Souza (USP)
  • Marcio Bigolin (IFRS)
  • Nataly Leopoldina Patti da Silva (SiDi)
  • Sidney Evaldo Leal (Venturus)
  • Vítor Rodrigues Tonon (UENP)

PROPOR 2026: CALL FOR DEMONSTRATIONS

The PROPOR 2026 Demonstration Track invites submissions presenting systems, tools, or products related to the computational processing of Portuguese and/or Galician. In line with previous editions, this track aims to foster interaction between academia and industry by offering a forum that goes beyond written or spoken descriptions of research. Demonstrations should enable attendees to engage with and test the systems during the dedicated demo session, which will provide an informal and interactive environment. Both early-stage research prototypes and mature, fully developed systems are welcome.

Topics of interest:

The areas of interest include all topics related to theoretical and applied issues of written and spoken Portuguese and Galician, such as, but not limited to:

  • Natural language processing tasks (e.g. parsing, word sense disambiguation, coreference resolution)
  • Natural language processing applications (e.g. question answering, subtitling, summarization, sentiment analysis)
  • Natural language generation
  • Information extraction and information retrieval
  • Speech technologies (e.g. spoken language generation, speech and speaker recognition, spoken language understanding)
  • Speech applications (e.g. spoken language interfaces, dialogue systems, speech-to-speech translation)
  • Resources, standardization and evaluation (e.g. corpora, ontologies, lexicons, grammars)
  • NLP-oriented linguistic description or theoretical analysis
  • Distributional semantics and language modeling
  • Portuguese language varieties and dialect processing (including the language varieties of Angola, Brazil, Cape Verde, East Timor, Galicia, Guinea-Bissau, Macau, Mozambique, Portugal, and Sao Tome and Principe)
  • Multilingual studies, methods, applications and resources including Portuguese/Galician

The systems may be of the following kinds:

  • Natural Language Processing systems or system components
  • Application systems using language technology components
  • Software tools for computational linguistics research
  • Software for demonstration or evaluation
  • Development tools

Important Dates (Updated!)

  • Demos submission deadline: Feb 15th, 2026 Feb 6, 2026(23:59 GMT-12)
  • Notification of acceptance or rejection: Mar 9th, 2026 Mar 2nd, 2026
  • Camera-ready papers due: Mar 15th, 2026
  • Conference: April 13th - 16th, 2026

Submissions

Submissions should consist of a non-anonymous brief description document of up to three pages of content, including references. Developers must outline the main characteristics of their system/product/tool, provide sufficient details to allow its evaluation, and give information on how they plan to demonstrate it. Developers are encouraged to focus their description on the relevance of the computational processing component of Portuguese or Galician in the proposed system.

Submissions should be written in English or Portuguese.

At submission time, only PDF format is accepted. For the final version, authors of accepted papers will be given one extra content page to take the reviews into account. Authors of accepted papers will be requested to send the source files for the production of the proceedings.

All submitted papers must conform to the official ACL style guidelines. ACL provides style files that meet these requirements. They can be found at:

Publication

Accepted papers are expected to be published by ACL as a volume in ACL Anthology as part of the PROPOR 2026 proceedings. They will be available online. To ensure publication, at least one author of each accepted paper must complete an adequate registration for PROPOR 2026 by the early registration deadline.

Presentation format

Accepted demos will be presented at a designated demo session with an optional accompanying poster. Developers should make sure they can run their demos properly. Thus, it is the authors' responsibility to provide the necessary technical conditions (i.e. equipment) for the demo at the conference. Note that the local organizers will not provide any hardware or software. Free high-speed Internet access will be available.

There will be a best demo award for the best-presented project.

Further details on the date, time, and instructions of the demonstration session(s) will be determined and provided at a later date.

Submissions must be sent through the CMT submission system bellow, selecting the PROPOR2026 Demo Track papers track:

Demo Track chairs

  • Evandro Fonseca (Blip)
  • Susana Sotelo (Universidade de Santiago de Compostela)

Scientific Committee

  • Andre Carvalho (UFAM)
  • Bruno Souza Cabral (Escavador)
  • Daniela Schmidt (U Évora)
  • Jesus M. Benitez Baleato (USC)
  • José Ramom Pichel (imaxin.software, USC)
  • Livy Real (UFAM/ Jusbrasil)
  • Luis Trigo (UP)
  • Luiz Merschmann (UFLA)
  • Saullo Oliveira (PUC-Campinas)
  • Thiago Pardo (USP)

Workshops

This Year, the following workshops will be co-located with PROPOR 2026.
First Workshop on Language Technologies for Health

Lang4Health

The First Workshop on Language Technologies for Health is a workshop dedicated to the development and application of Natural Language Processing (NLP) technologies in the healthcare field

More Information on the
Workshop Website
Third Student Research Workshop

SRW 2026

The Third Student Research Workshop is dedicated to providing an accessible, supportive, and high-quality forum for students from undergraduate to early-stage PhD to present and discuss their research

More Information on the
Workshop Website
Fourth Workshop on Digital Humanities and Natural Language Processing

DHandNLP 2026

The 4th Workshop on Digital Humanities and Natural Language Processing brings together researchers from humanities and NLP with work stemming from humanities that deal with language.

More Information on the
Workshop Website

Tutorial

In the 2026 edition of PROPOR, we are glad to inform that the following Tutorial will be presented in the conference:

From Syntax to Semantics: Introducing UMR for NLP Annotation

Adriana S Pagano (UFMG), Magali Sanches Duran (USP), Federica Gamba (CUni)

Uniform Meaning Representation (UMR) is a cross-linguistic semantic representation framework designed to encode sentence meaning in a structured and interpretable way. Building on the foundations of Abstract Meaning Representation (AMR), UMR extends semantic coverage to events, participants, semantic roles, temporal/aspectual information, modality, and discourse links. It is language-agnostic and therefore suitable for multilingual exploration.

This tutorial provides a beginner's introduction to UMR aimed at an audience with no prior experience with AMR, UMR, or meaning representations. The tutorial begins with a simple introduction to the essentials of Universal Dependencies (UD) needed to understand how UMR graphs can be constructed from syntactic information. Using simple Portuguese examples, the tutorial illustrates how basic UD structures guide the creation of UMR graphs. Participants will leave with a foundational understanding of what UMR is; how it relates to syntax and semantic roles; how to create minimal UMR graphs, and how Portuguese UD treebanks can support UMR annotation.

More Information Register Here

Schedule

Click on the following buttons to navigate between the program at glance and detailed programs for each day of the conference

Download Full Program as PDF: Here

Program at Glance
Time 13/04 14/04 15/04 16/04
Mercado Modelo 1 Mercado Modelo 2 Mercado Modelo 3 Mercado Modelo 2 Mercado Modelo 3 Mercado Modelo 2 Mercado Modelo 3 Mercado Modelo 2 Mercado Modelo 3
08:00 Registration
08:30 SRW Lang4Health DHandNLP Welcome Speech Registration Registration
09:00 Keynote speech: Maria das Graças Volpes Nunes Keynote speech: Margarida Petter Keynote speech: Jorge Baptista
10:10 Coffee Break
10:30 Coffee Break Technical Session 1 Technical Session 2 Technical Session 9 Poster Session 1 Technical Session 13 Best Dissertation Session
11:00 Panel: INCT-TILDIAR Lang4Health DHandNLP
12:00 Panel: INCT-TILDIAR Lang4Health DHandNLP Lunch Break
12:30 Lunch Break
14:00 Tutorial Lang4Health DHandNLP Technical Session 3 Technical Session 4 Technical Session 10 Poster Session 2 Technical Session 14 Industry and Demo Sessions
15:00 Coffee Break
15:30 Coffee Break Technical Session 5 Technical Session 6 Technical Session 11 Poster Session 3 Technical Session 15 Technical Session 16
16:00 Tutorial Lang4Health DHandNLP
16:30 Book Launch: BPLN new Edition
17:00 Technical Session 7 Technical Session 8 Technical Session 12 Poster Session 4 Community Meeting
14 Abril - PROPOR
Begin End Mercado Modelo 2 Mercado Modelo 3
08:00 08:30 Registration
08:30 09:00 Welcome Speech
09:00 10:10 Keynote Speech
O lugar do PLN em tempos de IA generativa
Maria das Graças Volpe Nunes
10:10 10:30 Coffee Break
10:30 12:00 Technical Session - TS1
Chair: António Branco
Technical Session - TS2
Chair: Adriana Pagano
Accelerating Portuguese Masked Diffusion Models through Representation Alignment
Adalberto Junior; Lucas Neves; Adriano Santana

AMALIA: An Open Source Large Language Model for European Portuguese
Afonso Simplício; Gonçalo Vinagre; Miguel Ramos; Diogo Tavares; Rafael Ferreira; Giuseppe Attanasio; Duarte Alves; Inês Calvo; Inês Vieira; Rui Guerra; James Furtado; Beatriz Canaverde; Iago Paulo; Vasco Ramos; Diogo Silva; Miguel Faria; Marcos Treviso; Daniel Gomes; Pedro Gomes; David Semedo; André Martins; João Magalhães

A Comparison of Methods to Bias Translation Toward Portuguese Variants
Catarina da Costa; Sebastian Padó

NLP-based Page Classification for Efficient LLM Extraction from Brazilian Public Tender Documents
Pedro Campos; Ivo de Medeiros; Adailton de Araújo

Biatron: A Parameter-Efficient Small Language Model for Brazilian Portuguese with Integrated Mathematical Reasoning
Daniel Fazzioni; Maria de Almeida; Anna Moreira; Anderson Soares; Sávio de Oliveira; Fernando Federson
Semantic Representation of Relative Clauses in Lexicalized Abstract Meaning Representation
Jorge Baptista; Sónia Reis

Anatomy of Data Repositories for Analysis and Detection of Toxicity in Portuguese
Lorena Moreira; Paula Gibrim; Leonardo Rocha; Julio Reis

Síntese de Voz Emocional Multi-Idioma para Português Brasileiro: Uma Análise Comparativa de Abordagens de Ajuste Fino
Daniel Brito; Sidney Leal; Arnaldo Cândido Júnior

Libras-UFPel Corpus: A Parallel Dataset of Brazilian Sign Language and Portuguese for Multimodal Research and Processing
Antonielle Martins; Brenda Santana; Francielle Martins; Tatiana Lebedeff; Darley Nunes; Luisa Bohm

Analysis of Machine Translators on Sentences Generated by Portuguese Image Captioning Models
Natan Moura; João Gondim; Babacar Mane; Daniela Claro
Lunch Break
14:00 15:00 Technical Session - TS3
Chair: Graça Nunes
Technical Session - TS4
Chair: Jorge Baptista
Exploring Knowledge Graphs for Automatic Fake News Detection in Portuguese
Lucas Santos; Manoel Rodrigues Euclides Santos; Yuri Silva Souza; João Pedro Holanda Sousa; Roney Lira de Sales Santos

LexIris-pt and LexBert-pt: Specialized Sentence Embeddings for Legal Similarity in Brazilian Portuguese
Willgnner Ferreira Santos; João Gabriel Grandotto Viana; Antônio Pires de Castro Júnior; Fernando Ribeiro Trindade; Nádia Félix Felipe da Silva

Optimizing Efficiency in Multi-Stage Semantic Re-ranking Architectures
Sávio de Oliveira; Artur Novais; Fernando Federson; Anna Moreira; Maria Almeida; João Presa
"Que ao mestre vai matá-lo?" The evolution of prepositional accusatives in Portuguese across time
Helena Rodrigues Menezes de Oliveira Vaz

Lexical and Orthographic Variation in Portuguese Financial Tweets: Annotation, Analysis, and Implications for Embedding Models
Ariani Di Felippo; Norton Trevisan Roman; Bryan Khelven Barbosa; Gabriela Pinheiro de Oliveira; Clarissa Lenina Scandarolli

Levados em Consideração: Uma Avaliação de Vieses de Estima por Raça, Gênero e Região em Grandes Modelos de Linguagem em Português Brasileiro
João Lucas de Melo; Marlo Souza
15:00 15:20 Coffee Break
15:20 17:00 Technical Session - TS5
Chair: Hugo Gonçalo Oliveira
Technical Session - TS6
Chair: Evandro Fonseca
Compression-based Language Complexity under Register Variation in Portuguese
Felipe Serras; Marcelo Finger

Evolução de Padrões Linguísticos na Escrita Científica em Português: Uma Análise com NILC-Metrix
Thiago Lobo; Claudia Martins

A Multilingual Voice Analytics Module for Contact-Center Hiring
Wagner Bombardelli; Vanessa Marquiafavel; Edgard Kuboo; Erica Missao

Automated Reformulation of Argumentative Essays to Improve Argument Organization and Development
Naomi Sutcliffe de Moraes; Denis Deratani Mauá

Topic Modeling in Brazilian Portuguese Documents on Antimicrobial Resistance
Enrique Susin; Lilian Berton
ALBA: A European Portuguese Benchmark for Evaluating Language and Linguistic Dimensions in Generative LLMs
Inês Vieira; Inês Calvo; Iago Paulo; James Furtado; Rafael Ferreira; Diogo Tavares; Diogo Silva; David Semedo; João Magalhães

Certas Palavras: A 1980s-90s Brazilian Radio Corpus to Test TTS Models in Noisy Multi-Speaker Dialogues
Gustavo Araújo; Sidney Leal; Sandra Aluisio; Arnaldo Cândido Júnior; Moacir Ponti; Gustavo Lopes; Renato Silva; Miguel Júnior; Edresson Casanova; Adriana Santos

Combining Real and Synthetic Speech for ASR Adaptation in Brazilian Portuguese
Daniel Ribeiro da Silva; Maria Eduarda Silva Borba; Állan Christoffer Pereira Silva; Maria Carolina Silva Barreto; Arthur Fernandes de Morais; Paulo Victor dos Santos; Guilherme Dutra; Sávio Salvarino Teles de Oliveira; Anderson da Silva Soares

Bridging Citizens and Public Services: Improving Service Association with Retrieval-Augmented Generation (RAG) Labels
Ticiana Linhares Coelho da Silva; Celso França; Marcos André Gonçalves; Leonardo Rocha; Leonardo Alamy; Fernando Sola Pereira; Eduardo Soares de Paiva

Enhancing Brazilian Inflation Forecasts through Sentiment Analysis Using Large Language Models
Lucas Rezende; Cezio Ferreira Junior; Mateus Machado; Evandro Ruiz
17:00 18:00 Technical Session - TS7
Chair: Larissa Freitas
Technical Session - TS8
Chair: Sandra Aluísio
Experimental Evaluation of Topic Modeling Methods for Categorizing Irregularities in Health-related news
Alysson Guimarães; Methanias Colaço Junior; Samuel Almeida; Raphael Fontes

Retrieval-Augmented Generation with Small Language Models for Fake News Detection
Lucca Ferraz; Jhúlia Leal; Anderson Avila; Thiago Pardo; Fernando Batista; Renato Silva

Multi-Agent Architecture with RAG and Dynamic Context Windows for Text-to-SQL Optimization
Willgnner Ferreira Santos; Paulo Victor dos Santos; Marcella Scoczynski Ribeiro Martins; Larissa Freire Lekakis; Frederico Lemes Rosa; Bruno Matheus Costa; Miguel Alves Pereira Filho; Isabella Alves Montalvão
A Multimodal Framework for Financial Fake News Detection for Brazilian Portuguese
José Vitor Souza Cardoso Requena; João Victor Assaoka; Lilian Berton

Specializing a Small Language Model for Closed-Domain Portuguese RAG using Knowledge Graph Supervision
Josué Caldas; Elvis de Souza; Patricia Silva; Marco Aurélio Pacheco

Retrieval-augmented generation and Knowledge Graphs in Portuguese-Language Legal Documents
Vinícius Oliveira; Deivison Oliveira; Mateus Souza; Maurício Lima; Sávio Oliveira; Thierson Rosa
15 Abril - PROPOR
Begin End Mercado Modelo 2 Mercado Modelo 3
08:00 09:00 Registration
09:00 10:10 Keynote Speech
Português: uma língua de passagem pela Europa, África, América do Sul e Ásia
Margarida Petter
10:10 10:30 Coffee Break
10:30 12:00 Technical Session - TS9
Chair: Vladia Pinheiro
Poster Session - PS1
Gender Identification in Brazilian Portuguese Product Reviews: A Comparative Study of Classical Models, BERT, and LLMs
Tiago de Melo; Carlos Maurício

Evaluating Small Language Models for English–Portuguese Translation: Impact of Model Scale and Quantization
Gustavo Tamiosso; Rafael Oleques Nunes; Dennis Giovani Balreira

Geological Text Summarization Using Generative Large Language Models
Matheus Stein de Aguiar; Rafael Oleques Nunes; Dennis Giovani Balreira

Rating–Text Mismatch in Brazilian Portuguese Reviews: How Reliable Are Zero-Shot LLMs?
Emanuelle Marreira; Tiago de Melo; Carlos Maurício

Can I guess where you are from? Modeling dialectal morphosyntactic similarities in Brazilian Portuguese
Manoel Siqueira; Raquel Freitag
The Inadequacy of Automatic Evaluation Metrics in Question Answering: A Case-Study in Portuguese
Júlia da Rocha Junqueira; Viviane Moreira

Quando as Máquinas “Pensam”: Antropomorfização em Inteligência Artificial e Implicações para o Processamento de Linguagem Natural em Português
Anabela Barreiro

A Lexicon-Grammar of Brazilian Portuguese Predicative Adjectives
Ryan Martinez; Jorge Baptista; Oto Vale

CoDEl-BR: An Electoral Debate Corpus in Brazilian Portuguese
Alessandra Gomes; Aline Paes; Helena Caseli

Causal_QA.PT: A Human–LLM Co-Curated Benchmark for Causal Question Answering in Portuguese Language
Lia Furtado; Cíntia Araripe; Jocelani Castilhos; Lucas Holanda; Vladia Pinheiro

ConsumerBR: A Large-Scale Corpus of Consumer Complaints in Brazilian Portuguese
Luis Duarte; Pedro Giacomin; Vitória Bispo; Mariana Santos; Adriano Pereira; Gisele Pappa

CURUPIRA: Clever guard for linguistic prompt mitigation in Brazilian Portuguese
Rogério Sousa; William Castañeda; José Homeli; Marcellus Amadeus

Lost in Quantization: Disproportionate Degradation of Morphologically Rich Languages in INT8 vs. FP8 Inference
Guilherme Silva; Pedro Silva; Matheus Peixoto; Gladston Moreira; Eduardo Luz

Efficient Fine-Tuning Methods for Portuguese Question Answering: A Comparative Study of PEFT on BERTimbau and Exploratory Evaluation of Generative LLMs
Mariela Nina; Caio Veloso; Lilian Berton; Didier Vega

Structured Sentiment Analysis in Brazilian Portuguese: An Exploratory Study Using BERTimbau
Andrew Borges de Campos; Ulisses Brisolara Corrêa; Larissa Astrogildo de Freitas

Agent Orchestration - LLM for Legal Metadata Extraction: A Comparative Analysis of Efficiency and Precision
Luiz Anísio Batitucci; Luciane Lopes; Rhodie Ferreira; Emerson Paraiso

Semantic adapters in text-to-SQL for low-resource languages: the importance of semantic information
Anton Labate; Fabio Cozman

Lunch Break
14:00 15:00 Technical Session - TS10
Chair: Leonardo Zílio
Poster Session - PS2
Portuguese Sentiment Analysis with Open-Source LLMs: Models, Prompts, and Efficient Deployment
João Vitor Lima; Vládia Célia; Carlos Caminha

Data Augmentation for Named Entity Recognition in Domain-Specific Scenarios in Portuguese
Higor Moreira; Patricia Ferreira da Silva; Luciana Bencke; Viviane Moreira

Twenty Years of HAREM: A Reproducible Audit and Reassessment of Portuguese Named Entity Recognition
Rafael Oleques Nunes; Andre Spritzer; Carla Maria Dal Sasso Freitas; Dennis Giovani Balreira
Modeling Linguistic Violence: An Ontology-Based Framework for the Computational Analysis of Violence Manifested in Language
Brenda Santana; Ana Fleischmann; Aline Vanin

Software for Automatic Speech Recognition via Whisper models applied to Oral History interviews in the Portuguese language
Edgleide Silva; Fernando Zagatti; Filipe Loyola Lopes; Anderson Dias Duarte; Rodrigo Bonacin; Angela Maria Alves

Uma Abordagem Híbrida para Predição de Faixa Etária de Autores de Textos Escritos na Língua Portuguesa
Luiz Merschmann; Alice Ribeiro

Discovery of Legal Patterns in Civil Petitions via LLM-Based Fact Extraction and Density Clustering
Rhedson Esashika; Carlos M. S. Figueiredo; Tiago de Melo

Sintomas Linguísticos: Geração Aumentada por Recuperação e Raciocínio em LLMs sob a Variação Português-Inglês em Contextos Médicos
Guilherme Vianna de Moura; Gabriel Assis; Aline Paes

Diálogos Tóxicos: Gatilhos e Padrões de Interação no Reddit Brasileiro
Giovana Piorino; Marco Antônio de Alcântara Machado; Luiz Henrique Quevedo Lima; Adriana Pagano; Ana Paula Couto da Silva

Automatic Speech Recognition for Child Reading: A Phonemic Approach using Isolated Words in Brazilian Portuguese
Aline Rodrigues; Carlos Ribeiro

Geração de consultas SPARQL a partir de linguagem natural
Heber Xavier de Castro; Clever Ricardo Guareis de Farias

Democratizing Legal Analytics: Resource-Efficient Information Extraction for Brazilian Case Law
Rodrigo Dornelles

Cartas Indígenas ao Brasil: Classificação Multi-Rótulo
Caio Almeida; Renata Vieira; Débora Abdalla

Identificação de notícias falsas em português: um olhar sobre a generalização de modelos
Raphael Guedes; Bruno Barros; Hugo do Nascimento

Avaliação Automática de Redações do ENEM: Uma Análise Comparativa entre Engenharia de Características e Transformers
Pâmela Chalegre; Vitor Machado; Valéria Feltrim

Avaliação End-to-End de um Sistema RAG para Documentos Hospitalares em Português
Murilo Vargas da Cunha; Marília Rosa Silveira; César Brasil Sperb; Brenda Salenave Santana; Larissa Astrogildo Freitas; Ulisses Brisolara Corrêa

Field of Science and Technology Classification of Academic Documents in Portuguese
Ivo Simões; Hugo Gonçalo Oliveira; João Correia

dialect2vec: Um método baseado em vetores para transcrição dialetal do português a partir de questionários do ALiB
Laila Mota; Daniela Claro; Rerisson Cavalcante; Eloize Seno
15:00 15:20 Coffee Break
15:20 17:00 Technical Session - TS11
Chair: Nádia Silva
Poster Session - PS3
Extending an Ensemble Baseline with Corpus-Based Graph Features for Portuguese Pun Detection
Avelar de Sousa; Camilla Sousa; Carlos Henrique Barros; Rafael Anchiêta

Uso de técnicas de Aprendizado de Máquina e Modelos de Língua de Larga Escala para avaliação automática de textos do exame Celpe-Bras
Rafael Oleques Nunes; Bernardo Cobalchini Zietolie; Ricardo Zanini De Costa; Rodrigo Brock da Silva; João Victor Piardi Pacheco; Rafaela Dall'Agnol da Rocha; Dennis Giovani Balreira; Elisa Marchioro Stumpf; Juliana Roquele Schoffen

A Multitask Transformer for Offensive Language Detection and Target Identification in HateBR
Guilherme Silva; Pedro Silva; Matheus Peixoto; Gladston Moreira; Eduardo Luz

Evaluating Reference-Free Summarization Quality Metrics for Portuguese: A Study with Human Judgments in Financial News
João Victor Assaoka Ribeiro; Thomas Pires Correia; José Vitor Souza Cardoso Requena; Lilian Berton

Language Effects in Text-to-SQL Across English and Portuguese
Lucas Nobre; Suele Sousa; Savio Oliveira; Anderson Soares
The Superficiality Bias: Community Votes and Answer Utility in Portuguese Health Question Answering
Carlos Henrique Santos Barros; Gustavo Figueredo Rodrigues de Sousa; Rogério Figueredo de Sousa

FlexQwen: Exploring Hybrid Objectives and Text Originality for Portuguese
Miguel Carpi; Marcelo Finger

Improving Machine Translation of Idioms: A Spanish–Galician Parallel Dataset and Synthetic Augmentation Approach
Lúa Santamaría Montesinos; Saúl Buján; Daniel Bardanca; Pablo Gamallo

Detecting Stuttering with Artificial Intelligence: A Hybrid Method for Brazilian Portuguese
Rubens Buzzeti; Paula Buzzeti; Roney Santos

Automatic Metrical Scansion of Galician poetry: First Results
Pablo Ruiz Fabo; Pauline Moreau; Anxo Alonso Pérez

Dependency Distance Effects on Eye-Tracking Measures in Brazilian Portuguese
Diego Alves

AspectRAG: Uma Arquitetura de Recuperação e Geração para Análise de Sentimentos Baseada em Aspectos
Erick Ribeiro; Andre Carvalho; Rhedson Esashika

Prompt Engineering for Named Entity Extraction from Portuguese Legal Documents
Giovanni Medeiros; Catarina Silva; Hugo Gonçalo Oliveira

Token-Level Pun Location Using Multi-Layer BERT with Mixture of Experts
Rafael Anchiêta; Roney Santos; Raimundo Moura

Unsupervised Evaluation of Explanations for Hate Speech Classification in Portuguese
Isabel Carvalho; Hugo Gonçalo Oliveira; Catarina Silva

Análise de Sentimento Baseada em Aspectos no Domínio de Acomodações Utilizando o modelo BERTimbau
Franco Pereira; Larissa Freitas; Ulisses Correa

CLARIN-PT-LDB: An Open LLM Leaderboard for Portuguese to assess Language, Culture and Civility
João Ricardo Silva; Luís Gomes; António Branco
17:00 18:00 Technical Session - TS12
Chair: Roney Lira
Poster Session - PS4
Leveraging political alignment information for stance detection
Matheus Pavan; Ivandre Paraboni

NorBERTo: A ModernBERT Model Trained for Portuguese with 331 Billion Tokens Corpus
Lucas Pellicer; Guilherme Rinaldo

Auditing the Evaluators: How Far Can Automatic Evaluation Go in Assessing Portuguese Financial Texts?
Marina de Souza Masid; Gabriel Assis; Daniela Vianna; Aline Paes; Altigran Soares da Silva
Portho: A Corpus-Based Resource of Orthographic Neighbors in European Portuguese
Eugénio Ribeiro; David Antunes; Nuno Mamede; Jorge Baptista

Math-PT: A Math Reasoning Benchmark for European and Brazilian Portuguese
Tiago Teixeira; Ana Carolina Erthal; Juan Belieni de Castro Araujo; Beatriz Estevens Canaverde; Miguel Faria; Diego Mesquita; Eliezer de Souza da Silva; André F. T. Martins

To Describe or Not to Describe? Benchmarking Database Representations for Schema Linking in Text-to-SQL
Daiane Kreitlow; Hilário Oliveira

Evaluating Automated Scoring Models on Official ENEM Essays
Laís Nuto Rossman; Igor Cataneo Silveira; Denis Deratani Mauá

Analyzing Debate Dynamics in the Portuguese Parliament with Dialogue Action Flows
Patrícia Ferreira; Ana Alves; Catarina Silva; Hugo Gonçalo Oliveira

Avaliação Automática de Redações do ENEM: Um Estudo Empírico sobre Representações Linguísticas e Contextuais
Gabriel Matos; Valéria Feltrim

Neuro-symbolic Approaches for Rubric-Based Automatic Essay Evaluation of ENEM Essays
Igor Cataneo Silveira; Denis Deratani Mauá

NormaTex-MapSNOMED: Bridging the Gap Between Brazilian Portuguese Clinical Narratives and SNOMED CT
Isabela Araujo; Layslla Martinez; Claudia Moro

Think Portuguese with Bode Reasoning
Gabriel Garcia; André Schuck; João Renato Manesco; Pedro Henrique Paiola; Leandro Passos; João Paulo Papa

A UD Parser to the Rescue: A Method for Bringing a Classical Annotated Corpus to Life Again
Lucelene Lopes; Magali Duran; Thiago Pardo

LegalSim-PT: Building a Dataset for Legal Document Simplification in Portuguese Leveraging Linguistic Metrics
Arthur Scalercio; Maria Finatto; Aline Paes

Retrato_Cantado: Criação e Análise de um Corpus para Representações de Identidade em Letras de Músicas Brasileiras
Vitória Firmino; Janaina Lopes; Bruno Nogueira; Valéria Reis

RacismoBR: A Manually Annotated Dataset for Racist Discourse Detection in Brazilian Portuguese
João Vítor Vaz; Marcos André Gonçalves; Fabricio Benevenuto

LARI Dataset: A Native Dataset for Context-Aware Question Answering in Portuguese
Júlia da Rocha Junqueira; Larissa de Freitas; Ulisses Corrêa

Viés de gênero na tradução automática: uma avaliação no par inglês-português
Tayane Soares; Yohan Gumiel; Rafael Junqueira; Tácio Gomes; Adriana Pagano
16 Abril - PROPOR
Begin End Mercado Modelo 2 Mercado Modelo 3
09:00 10:10 Keynote Speech
Abstract Meaning Representation: New Rosetta Stones for the Babel Entanglement
Jorge Baptista
10:10 10:30 Coffee Break
10:30 12:00 Technical Session - TS13
Chair: Viviane Moreira
Best Dissertation
Enhanced Universal Dependencies in the Wild: Evaluating Portuguese EUD Parsing in Realistic Scenarios
Elvis de Souza; Thiago Pardo

UlyssesLegalNER-Br: from Legislative to Legal, a comprehensive corpus of Brazilian legal documents for named entity recognition
Hidelberg Albuquerque; Ellen Pereira; Danilo Lucena; Heldon Albuquerque; Nadia Silva; Marcio Dias; Rafael Nunes; Adiano Oliveira; André Carlos Carvalho

The PROPOR Ecosystem: Structure, Roles, and Evolution of Portuguese-Language NLP
Rafael Oleques Nunes; Gustavo Lopes Tamiosso; Pedro Lucas Castro de Andrade; Matheus Stein de Aguiar; Rafael Pereira de Gouveia; Higor Moreira; Bruno Tavares; Laura Pereira de Gouveia; Felipe Soares Fagundes Paula; Andre Spritzer; Dennis Giovani Balreira; Joel Luís Carbonera; Hidelberg O. Albuquerque; Nádia Félix Felipe da Silva; Ellen Polliana Ramos Souza Pereira

Automatic Question classification in Portuguese: A Large-Scale Dataset and Comparative Evaluation of Classification Strategies
Murilo Boccardo; Valéria Feltrim

BIPA: Brazilian Portuguese Phonetic Dataset with Dialectal Variations in IPA Standard
Thiago de Sousa; Lucas Gris; Nádia Félix
Automated Essay Scoring for Brazilian Portuguese. Evidence from Cross-Prompt Evaluation of ENEM Essays
Andre Barbosa; Denis Mauá

Evaluation of Grammatical Patterns in Transformers for Brazilian Portuguese: A Syntactic Analysis Based on Attention Heads
Ricardo Gomes; Daniela Claro; Rerisson Cavalcante

Socially Responsible and Explainable Automated Fact-Checking and Hate Speech Detection
Francielle Vargas; Fabrício Benevenuto; Thiago Pardo

Evolving Open Information Extraction for Portuguese Employing Language Models
Bruno Cabral; Daniela Claro; Marlo Souza
Lunch Break
14:00 15:00 Technical Session - TS14
Chair: Thiago Pardo
Industry and Demo
Analysing LLMs for spelling normalization of 18th century Portuguese
Helena Cameron; Aline Paes; Fernanda Olival; Renata Vieira

Marcação de correferência para a caracterização de personagens em obras literárias em português
Diana Santos; Luisa Mara Silva Lima; Emanoel Pires

Global vs. Local Sentence Embeddings for Brazilian Portuguese: Revisiting Monolingual Models in the Age of Foundation Models
Matheus Peixoto; Guilherme Silva; Giacomo Figueredo; Pedro Silva; Eduardo Luz
To be Announced
15:00 15:30 Coffee Break
15:30 16:30 Technical Session - TS15
Chair: Clarissa Xavier
Technical Session - TS16
Chair: Helena Caselli
Contrastive and Adversarial Disentanglement for Speaker Representations in Brazilian Portuguese
Ariadne Matos; Arnaldo Cândido Júnior; Moacir Ponti

Negation-Aware Data Augmentation for Portuguese Natural Language Inference
Maria Cecília Corrêa; Felipe Paula; Matheus Westhelle; Viviane Moreira

JabuticaBERT: Modern Portuguese Encoders from Scratch with RTD and Long-Context Training
Thiago Porto; Gabriel Gomes; Alexandre Bender; Ulisses Corrêa; Larissa Freitas; William Cruz; Marcellus Amadeus
Structured Summaries for Retrieval-Augmented Generation in Portuguese-Language Consumer Complaints
Rafael Sant'Ana; Pedro Garcia; Luis Duarte; Mariana Santos; Adriano Pereira; Gisele Pappa

Exploring Sentiment Analysis Approaches in a Public Agency Security News Dataset
Thiago Lobo; Claudia Martins

Formalising the DATASUS RTS: An Ontological Model for an RDF Knowledge Graph
Vitor Fuentes Ferreira Pires; Dalvan Griebler; Felipe Meneguzzi
16:30 17:00 Book Launch: BPLN - Natural Language Processing 4th Edition
17:00 18:00 PROPOR Community Meeting

Conference Dinner

The Conference Dinner will be held at the Bargaço Restaurant

Restaurante Bargaço

Restaurante Bargaço is on of the most traditional Bahia regional food restaurants of the city, operating since 1971. It is located at R. Antônio da Silva Coelho, Quadra 43 - Lote 18 - Jardim Armação, Salvador.

Tickets to the conference dinner can be bought through the ECOS registration system until 13/04 and directly in the conference until 14/04. The number of tickets is limited and extra tickets may vary in prices.

Dinner Menu

  • Entrée: Acarajé, shrimp fried pastel with catupiry cheese, cod and fish fritters, and toast.
  • Main Dish: Shrimp and fish moqueca or shrimp and fish stew, grilled fish with vegetables, shrimp bobó, grilled filet mignon or chicken fillet and salad.
  • Vegetarian Option: Vegetable Moqueca or vegetable stew
  • Side dishes: Rice, farofa (toasted cassava flour), vinaigrette, pirão (cassava and fish porridge), french fries.
  • Dessert: Pie, pudding, and seasonal fruits.
  • Drinks: Mineral water, coconut water, juice, soda, and beer.

Registration

Registration fees

Early Bird (payment until Mar 01 Mar 06, 2026)
  • Professional Non SBC member: R$ 1.800,00
  • Professional SBC Member: 1.550,00
  • Student Non SBC member: R$ 450,00
  • Student SBC Member: R$ 400,00
  • Workshops and Tutorial-only: R$ 600,00
Regular Registration (payment from Mar 02 Mar 07, 2026 until Mar 23, 2026)
  • Professional Non SBC member: R$ 2.200,00
  • Professional SBC Member: 1.800,00
  • Student Non SBC member: R$ 500,00
  • Student SBC Member: R$ 450,00
  • Workshops and Tutorial-only: R$ 600,00
Late Registration (payment from Mar 23, 2026)
  • Professional Non SBC member: R$ 2.500,00
  • Professional SBC Member: 2.100,00
  • Student Non SBC member: R$ 600,00
  • Student SBC Member: R$ 500,00
  • Workshops and Tutorial-only: R$ 600,00

Participation

Professional and Student Registrations cover participation in all days of PROPOR 2026. Wrokshop and Tutorial-only cover participation on the first day of the conference, i.e., Apr 13, 2026

Publication

To ensure publication, at least one author of each accepted paper must complete a full, i.e. Professional, registration for PROPOR 2026 by the early registration deadline (for main track) or Regular Registration deadline (for workshops and other tracks). Each Professional registration can cover up to 2 (two) papers in the main track, or 1 (one) paper in the main conference and any number in the workshops, demo and industry tracks. Additional papers in the main conference may be included with an Main Conference Additional Paper Fee (2 papers in the main conference per fee). Additional papers in workshops, demo, or industry tracks (any number of them) can be included with thfe Workshop Additional Paper Fee.

Workshop registrations cover publications of any number of papers in the workshops conference as Student Registrations cover any number of papers in the Student Research Workshop.

Conference Dinner

The Conference Dinner will be held at the Bargaço Restaurant. There are 100 available tickets at the price of R$150,00, which can be bought through the ECOS registration system or directly at the conference.

Please consult the Conference Dinner Section for more information

How to register

Registrations will be possible through the ECOS management system

Student Support

Student support for participation in PROPOR 2026

The call for student grants for those wishing to participate in PROPOR 2026 in Salvador is now open.

Application period: Apr 7th to Apr 10th

Registration waiver and/or financial aid of R$ 400.00.

Application form: https://forms.gle/PcGTjHhmhoQkBwg1A

All undergraduate and graduate students, whether currently registered for PROPOR 2026 or not, are eligible to apply. Registration waivers are available for those not yet registered, and financial aid is available for students residing outside of Salvador, with priority given to those in a position of social vulnerability.

All financial support for this call has been provided by: "Artificial Intelligence Journal: FUNDING OPPORTUNITIES for PROMOTING AI RESEARCH"

Organization

PROPOR 2026 is organized by Federal University of Bahia (UFBA) and Brazilian Computer Society (SBC)

Organizing Committee

  • General Chairs

    • Marlo Souza - Universidade Federal da Bahia
    • Iria de-Dios-Flores - Universidade Pompeu Fabra - Barcelona
  • Program Chairs

    • Diana Santos - Universitetet i Oslo
    • Larissa Freitas - Universidade Federal de Pelotas
  • Editorial Chairs

    • Jackson Wilke da Cruz Souza - Universidade Federal da Bahia
    • Eugénio Ribeiro - INESC-ID
  • Workshops and Tutorial Chairs

    • Roney Lira de Sales Santos - Universidade Federal da Bahia
    • Renata Vieira - Universidade de Évora
  • Best Dissertation Chairs

    • Marcos Garcia - Universidade de Santiago de Compostela
    • Aline Paes - Universidade Federal Fluminense
  • Demo Chairs

    • Evandro Fonseca - Blip
    • Susana Sotelo - Universidade de Santiago de Compostela
  • Industry Track Chairs

    • Clarissa Xavier - Instituto Federal do Rio Grande do Sul
    • Henrico Brum - Sinch AB

Local Organization

  • Marlo Souza
  • Daniela Barreiro Claro
  • Jackson Wilke da Cruz Souza
  • Lilian Teixeira
  • Rerisson Cavalcante
  • Robespierre Pita
  • Roney Lira de Sales Santos

Event Venue

Event venue location info and gallery

Salvador

Salvador, also known as São Salvador da Bahia de Todos os Santos (English: Savior; Saint Savior from the Bay of All Saints) is the capital of the Brazilian state of Bahia. With 2.9 million people (2017), it is the largest city proper in the Northeast Region and the 4th largest city proper in the country, after São Paulo, Rio de Janeiro and Brasília.

Founded by the Portuguese in 1549 as the first capital of Brazil, Salvador is one of the oldest colonial cities in the Americas. A sharp escarpment divides its Lower Town (Cidade Baixa) from its Upper Town (Cidade Alta) by some 85 meters (279 ft). The Elevador Lacerda, Brazil's first elevator, has connected the two since 1873. The Pelourinho district of the upper town, still home to many examples of Portuguese colonial architecture and historical monuments, was named a World Heritage Site by UNESCO in 1985.

Salvador was the first slave port in the Americas and the African influence of the slaves' descendants in many cultural aspects of the city makes it a center of Afro-Brazilian (negro) culture. The city is noted for its cuisine, music, dance and architecture. Porto da Barra Beach in Barra has been named one of the best beaches in the world. Itaipava Arena Fonte Nova was the site of the city's games during the 2014 Brazilian World Cup and 2013 Confederations Cup.

Salvador forms the heart of the Recôncavo, Bahia's rich agricultural and industrial maritime district, and continues to be a major Brazilian port. Its metropolitan area, housing 3 899 533 people (2018) forms the wealthiest one in Brazil's Northeast Region (2015).

The Deputado Luís Eduardo Magalhães Airport connects Salvador with all major Brazilian cities and also operates several international flights.


Conference Location

PROPOR 2026 will be held at Mercure Hotel located at Rio Vermelho, which is one of the most famous neighborhood in Salvador (Bahia - Brazil). Besides its wonderful beaches (Paciência and Buracão), this place is known by a high number of pubs and restaurants, making you comfortable to try the traditional cuisine and local culture from Salvador.

Tourism and Accomodations

The PROPOR organization has partnered up with the Tourism and Accommodation company EMPORIO DE TURISMO to offer PROPOR participants discounts in hotels and touristic attractions in Salvador.

The tourism agency has made available for PROPOR participants an exclusive website with options and hotels and tours

Interested participants may also contact the company by telephone (+55 71 3342-7410 / +55 71 99601-2957 (whatsapp)), or email (atendimento@emporiodeturismo.com.br)

Information on touristic attractions and hotel indications will soon be provided

Restaurants and Bars

The event venue lies in a central area with many bars and restaurants for the participants to gather during social events and meetings. The address of each restaurant links to Google Maps for easy navigation.

Restaurants close to the event venue (sorted by distance to the event venue)

Nana.Veg

Description: Vegan Restaurant with a variety of options.
Distance: 2,7km
R. João Gomes, 9 - 1 Andar - Rio Vermelho.
Price: $$

Dona Mariquita - Cozinha Patrimonial da Bahia

Description: Good regional cuisine with stilyzed ambiance.
Distance: 3,3km
R. do Meio, 178 - Rio Vermelho.
Price: $$

Manjericão Restaurante Natural

Description: Natural food Restaurant with good selection of vegetarian dishes.
Distance: 3,6km
R. da Fonte do Boi, 3B - Rio Vermelho.
Price: $$

Shanti

Description: Panasian restaurant - each day dishes from a different contry (vegetaria options).
Distance: 2,7km
R. João Gomes, 10 - Rio Vermelho.
Price: $$

Restaurante Di Liana

Description: Italian cuisine focused on pasta and steak.
Distance: 1km
R. Macapá, 314 - Ondina.
Price: $$$

Restaurante Sertão e Mar

Description: Self-Service Restaurant with variety of seafood and local cuisine.
Distance: 1km
Tv. Baependi, 69 - Ondina.
Price: $

Restaurante Meu Chef

Description: Brazilian Cuisine.
Distance: 1km
Tv. Macapá, 66 - Ondina.
Price: $$

Stalo restaurante

Description: Self-Service Restaurant.
Distance: 1,1km
R. Baependi, 152 - Ondina.
Price: $$

Barravento

Description: Restaurant by the seaside with a beautiful coastal view serving regional dishes, seafood and meat. It also functions as a bar, serving drinks and beers.
Distance: 3,1km
Av. Oceânica, 814 - Barra.
Price: $$$

Saúde Brasil

Description: Natural food Restaurant with good selection of vegetarian and vegan dishes.
Distance: 3,1km
R. Humberto de Campos, 6 - Graça.
Price: $$

Support

PROPOR 2026 is made possible by the support of the following partners.

Communication

Address

Instituto de Computação - UFBA
Av. Milton Santos, s/n, Campus Universitário de Ondina