publications

2025

  1. ACL
    Truth Knows No Language: Evaluating Truthfulness Beyond English
    Blanca Calvo Figueras, Eneko Sagarzazu, Julen Etxaniz, and 4 more authors
    2025
  2. EMNLP
    Instructing Large Language Models for Low-Resource Languages: A Systematic Study for Basque
    Oscar Sainz, Naiara Perez, Julen Etxaniz, and 9 more authors
    2025
  3. COLING
    HiTZ at VarDial 2025 NorSID: Overcoming Data Scarcity with Language Transfer and Automatic Data Annotation
    Jaione Bengoetxea, Mikel Zubillaga, Ekhi Azurmendi, and 4 more authors
    2025
  4. arXiv
    Multimodal Large Language Models for Low-Resource Languages: A Case Study for Basque
    Lukas Arana, Julen Etxaniz, Ander Salaberria, and 1 more author
    2025
  5. arXiv
    BERnaT: Basque Encoders for Representing Natural Textual Diversity
    Ekhi Azurmendi, Joseba Fernandez Landa, Jaione Bengoetxea, and 5 more authors
    2025
  6. EACL
    BabyBabelLM: A Multilingual Benchmark of Developmentally Plausible Training Data
    Jaap Jumelet, Abdellah Fourtassi, Akari Haga, and 9 more authors
    2025
  7. arXiv
    Challenging the Abilities of Large Language Models in Italian: a Community Initiative
    Malvina Nissim, Danilo Croce, Viviana Patti, and 8 more authors
    2025

2024

  1. ACL
    Latxa: An Open Language Model and Evaluation Suite for Basque
    Julen Etxaniz, Oscar Sainz, Naiara Perez, and 6 more authors
    2024
  2. NeurIPS
    BertaQA: How Much Do Language Models Know About Local Culture?
    Julen Etxaniz, Gorka Azkune, Aitor Soroa, and 2 more authors
    2024
  3. NAACL
    Do Multilingual Language Models Think Better in English?
    Julen Etxaniz, Gorka Azkune, Aitor Soroa, and 2 more authors
    2024
  4. NAACL
    XNLIeu: a dataset for cross-lingual NLI in Basque
    Maite Heredia, Julen Etxaniz, Muitze Zulaika, and 3 more authors
    2024
  5. arXiv
    Lessons from the Trenches on Reproducible Evaluation of Language Models
    Stella Biderman, Hailey Schoelkopf, Lintang Sutawika, and 27 more authors
    2024
  6. SEPLN
    IKER-GAITU: research on language technology for Basque and other low-resource languages
    Eneko Agirre, Itziar Aldabe, Xabier Arregi, and 8 more authors
    2024
  7. CLiC-it
    GITA4CALAMITA - Evaluating the Physical Commonsense Understanding of Italian LLMs in a Multi-layered Approach: A CALAMITA Challenge
    Giulia Pensa, Ekhi Azurmendi, Julen Etxaniz, and 2 more authors
    2024
  8. EKAIA
    Latxa Euskarazko Hizkuntza-Eredua
    Naiara Perez, Julen Etxaniz, Oscar Sainz, and 7 more authors
    EKAIA EHUko Zientzia eta Teknologia aldizkaria, 2024

2023

  1. EMNLP
    NLP Evaluation in trouble: On the Need to Measure LLM Data Contamination for each Benchmark
    Oscar Sainz, Jon Ander Campos, Iker García-Ferrero, and 3 more authors
    2023
  2. MSc
    Grounding Language Models for Compositional and Spatial Reasoning
    Julen Etxaniz, Oier Lopez Lacalle, and Aitor Soroa
    2023

2021

  1. BSc
    ProMeta: softwarearen garapenerako prozesuen definizio eta ezarpenerako sistema metaereduetan oinarrituta
    Julen Etxaniz
    2021