laion_fmri.captions¶

Access per-stimulus captions for the LAION-fMRI images.

Each stimulus carries a small set of short human captions (collected on CloudResearch Connect). Shared non-OOD stimuli additionally carry one AI caption (a GPT-generated description). The target is:

shared images (seen by every participant) get 5 human captions and, for non-OOD images, 1 AI caption
unique images (one participant only) get 3 human captions and no AI caption
OOD images get their target human captions and no AI caption

Together they give you a small set of independent natural-language descriptions per image, useful for caption-conditioned modelling, retrieval, or quick qualitative checks.

Files on disk:

stimuli/
  task-images_desc-captions.csv

The CSV is long-form (one row per caption) with columns:

`image_name`	Stimulus filename. Join key against `task-images_metadata.csv`.
`caption_idx`	Position within the image. Rank `1` is the highest-quality human caption; ranks go up to `3` for unique images and up to `5` for shared images. The AI caption (if any) gets `0`.
`source`	`"human"` or `"ai"`.
`caption`	The caption text.
`origin_collection`	Which collection the caption came from (CloudResearch Connect batch labels for humans, model name like `"gpt-5.1"` for AI).
`participant_id`	CloudResearch Connect participant identifier (NaN for AI).
`ai_model`	Model name (NaN for human captions).

All images have their target human-caption count. AI captions are provided for shared non-OOD images only.

You normally reach Captions through the Stimuli hub:

>>> import laion_fmri
>>> stim = laion_fmri.load_stimuli()
>>> stim.captions.human("shared_12rep_LAION_cluster_1003_i0.jpg")
['a hand with light pink painted nails with flower designs',
 'A hand with finger painted nails with flowers in them',
 ...]
>>> stim.captions.ai("shared_12rep_LAION_cluster_1003_i0.jpg")
'A hand with short, pale pink polished nails features delicate floral nail art on two fingers.'

For a single row-level DataFrame of every caption attached to an image:

>>> stim.captions.get("shared_12rep_LAION_cluster_1003_i0.jpg")

Classes

Captions([data_dir])

Lazy reader for the per-stimulus captions CSV.

class laion_fmri.captions.Captions(data_dir=None)[source]¶

Bases: object

Lazy reader for the per-stimulus captions CSV.

Loads the CSV on first access and caches a per-image lookup.

Parameters:: data_dir (str or Path, optional) – Override the configured data directory. Defaults to laion_fmri.config.get_data_dir().
Raises:: FileNotFoundError – If stimuli/task-images_desc-captions.csv is not present. Captions are a public stimulus-side metadata file; run laion-fmri download-captions (or laion_fmri.download.download_captions()) to fetch them.

ai(image_name: str) → str | None[source]¶

AI caption for image_name, or None if not available.

AI captions are present for shared non-OOD images only.

get(image_name: str) → DataFrame[source]¶

Return all captions for one image as a DataFrame.

Returns an empty DataFrame (not an error) when the image has no captions. Rows are ordered by caption_idx: AI first (idx=0), then humans in rank order.

human(image_name: str, limit: int | None = None) → list[str][source]¶

Human captions for image_name in rank order.

Parameters:

image_name (str)
limit (int, optional) – Cap to the top-limit captions. None (default) returns all available (currently up to five).
captions. (Returns an empty list if the image has no human)

images() → list[str][source]¶: Image names that have at least one caption.

list(image_name: str, source: str | None = None) → list[str][source]¶

Captions for image_name as a list of strings.

Parameters:

image_name (str)
source ({"human", "ai"}, optional) – Restrict to one source. None (default) returns all available captions in caption_idx order.

property metadata: DataFrame¶

The captions CSV as a DataFrame (one row per caption).

Columns: image_name, caption_idx, source, caption, origin_collection, participant_id, ai_model.