Understanding the neural mechanisms underlying cognition and perception is a fundamental pursuit in neuroscience. This thesis addresses the challenge of de-coding brain activity to interpret and reconstruct human cognitive processes using functional magnetic resonancei maging (fMRI). By lever aging advancements in deep learning, we develop robust pipelines for decoding various sensory and cognitive modalities, including vision, language, music, and video. Central to our approach is the alignment of brain representations with computational models, assuming brain representations can be mapped on vectorial spaces of pretraining multimodal models, enabling seamless mappings between neural data and external stimuli. Through the integration of encoding and decoding models, we explore tasks ranging from image reconstruction and cross-modal decoding to language generation and video retrieval. Furthermore,this thesis delves into the concept of "brain algebra", examining how neural representations adhere to composition a land transformational principles ak into vector spaces. By perturbing brain activity in this high-dimensional semantic space, we uncover insightsin to the brain’s capacity for concept manipulation and compositionality. This work highlights the synergy between neuroscience and artificial intelligence, showcasing how multimodal, data-driven approaches can deepen our understanding of brain function and pave the way for innovative applications in brain-computer interfaces and cognitive modeling.

Multimodal brain decoding through deep learning / Matteo Ferrante , 2025 Jun 03. 37. ciclo, Anno Accademico 2024/2025.

Multimodal brain decoding through deep learning

FERRANTE, MATTEO
2025-06-03

Abstract

Understanding the neural mechanisms underlying cognition and perception is a fundamental pursuit in neuroscience. This thesis addresses the challenge of de-coding brain activity to interpret and reconstruct human cognitive processes using functional magnetic resonancei maging (fMRI). By lever aging advancements in deep learning, we develop robust pipelines for decoding various sensory and cognitive modalities, including vision, language, music, and video. Central to our approach is the alignment of brain representations with computational models, assuming brain representations can be mapped on vectorial spaces of pretraining multimodal models, enabling seamless mappings between neural data and external stimuli. Through the integration of encoding and decoding models, we explore tasks ranging from image reconstruction and cross-modal decoding to language generation and video retrieval. Furthermore,this thesis delves into the concept of "brain algebra", examining how neural representations adhere to composition a land transformational principles ak into vector spaces. By perturbing brain activity in this high-dimensional semantic space, we uncover insightsin to the brain’s capacity for concept manipulation and compositionality. This work highlights the synergy between neuroscience and artificial intelligence, showcasing how multimodal, data-driven approaches can deepen our understanding of brain function and pave the way for innovative applications in brain-computer interfaces and cognitive modeling.
3-giu-2025
Multimodal brain decoding through deep learning / Matteo Ferrante , 2025 Jun 03. 37. ciclo, Anno Accademico 2024/2025.
File in questo prodotto:
File Dimensione Formato  
13.Ferrante.thesis.37.pdf

accesso aperto

Tipologia: Tesi di dottorato
Licenza: Creative commons
Dimensione 135.19 MB
Formato Adobe PDF
135.19 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12610/95265
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact