Refer to github.com/kasra-hosseini for all open-source projects.

Selected Projects

  • MapReader
    An end-to-end computer-vision pipeline for the semantic exploration and analysis of images at scale. Applied transfer learning on pretrained vision models (CNNs, vision transformers) to classify ~16k historical maps (~30M geospatial patches). Winner of the Roy Rosenzweig Prize; adopted externally for plant phenotyping.
    SIGSPATIAL · JOSS · DH Awards 1st Runner-Up
  • DeezyMatch
    A flexible deep learning approach to fuzzy string matching and candidate ranking using deep learning and contrastive learning.
    EMNLP 2020
  • histLM
    Neural language models (word2vec, fastText, BERT, Flair) for historical research, trained on ~5B tokens of nineteenth-century English. Models available on Hugging Face and Zenodo.
    JOHD 2021
  • scivision
    A toolkit for scientific image analysis, connecting computer vision models to scientific imagery. Contributor.
  • obspyDMT
    A Python toolbox for retrieving, processing, and managing large seismological datasets.
    Solid Earth 2017
  • SubMachine · homepage
    Web-based tools for exploring 3-D seismic tomography and other models of Earth's deep interior. 750k+ queries served as of March 2026.
    G3 2018 · Front Cover Image · Top 20 most read
  • privgem
    Privacy-preserving generative models (GANs) for synthetic data generation.
  • GeoTree
    Nearest-neighbour searches on geographic coordinates using KDTree and BallTree.

Contributions

QUIPP-pipeline (privacy-preserving synthetic data generation) · PressPicker (interactive visualisation for picking newspaper titles) · daedalus (dynamic spatial microsimulation pipeline for population projections) · AxiSEM (spectral-element wave propagation) · obspy (Python framework for seismological data) · MC Kernel (sensitivity kernels via Monte-Carlo integration)