Skip to content

Tuning / tweaking Finto AI to give more creative or less-apparent subject headings suggestions #24

@tpalonen

Description

@tpalonen
  1. Currently, many of the subject headings suggestions given by Finto AI are terms that occur in the actual text. As an alternative, it would be interesting to have Finto AI give suggestions a bit above/below the text surface. Suggestions that might appear more creative, derivative, interpretative. It would be nice to have a kind of a knob in the user interface to allow the user to choose between very literal or more creative results or something in between. Technically, the more creative results could perhaps be produced by stressing MLLM and/or Bonsai more in the process.

===========================

  1. As an addition or as a separate suggestion to the above: it would be interesting to have an option to filter out the most commonly used subject headings from the suggestions. For example, "history" might be one the most commonly used subjects headings in the context of humanities. And if the text in question is a doctoral thesis in history, the subject heading "history" would be a very unsurprising and therefore not a very helpful result particularly if the user is looking for more creative suggestions. The challenge here is the context, as the most commonly used subject headings are domain-specific. Nevertheless, it might be interesting to see what the results would look like if a multidomain dataset, such as Melinda, could be used to identify and filter out the most common subject headings. These could be semi-automated, ie. first identified statistically from a dataset and secondly reviewed by an expert to see which subject headings are truly & globally generic and which ones just happen to stand out in the given dataset. Alternatively, Finto AI could perhaps have several domain-specific lists of unwanted subject headings. Finto AI could then first identify the text's domain and then apply the unwanted subject headings list of that domain to exclude the respective subject headings from the suggestions.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions