IUU+DB: Tracking Illegal, Unreported, and Unregulated Fishing, Seafood Fraud, and Labor Abuse
through LLM-driven Information Extraction
Abstract.
Illegal, unreported, and unregulated fishing (IUU) traditionally refers to fishing activities that violate applicable laws or occur in areas that lack applicable laws. We propose the term IUU+ to capture a broader suite of fisheries sector environmental and associated supply chain trade-related crimes and behaviors. Although IUU+ activity is widely recognized as a serious threat to marine ecosystems, markets, and livelihoods, a quantitative understanding of these incidents, e.g., their frequency, geography, species, actors, and patterns in the type of illicit activity, remains difficult to obtain. We propose IUU+DB, a large language model (LLM)–driven system for building a global incident database of IUU+ activity. The system ingests heterogeneous documents, classifies whether they describe relevant incidents, extracts key data elements such as actors, locations, species, vessels, violations, and enforcement outcomes, and supports deduplication and trend analysis. Case studies and validation results show that IUU+DB can help organize fragmented evidence, surface geographic and behavioral hotspots, support fisheries-domain specific research in academia and non-government organizations, assist source and species risk assessments for industry, and provide support for policy implementation and targeted enforcement efforts to government agencies.
1. Introduction
Illegal, unreported and unregulated fishing (IUU) undermines sustainable fisheries management which relies on capping harvests, limiting the use of destructive gears, limiting (or minimizing) harvest of juveniles, and avoiding fishing in sensitive ecosystems, among other regulations. While it is difficult to quantify the full extent of illegal fishing, a 2009 estimate of global losses from IUU fishing amounted to $10 bn and $23.5 bn annually, representing between 11 and 26 million metric tons (Agnew et al., 2009).
While IUU fishing encompasses a broad set of activities, there is a broader set of associated or analogous activities across the seafood sector, including illegal aquaculture, mislabeling, labor abuses, and trade prohibitions and sanctions evasion. We therefore offer a more inclusive term (IUU+) to capture the broader suite of human, environmental, and associated supply chain-related crimes and behaviors. IUU+ behaviors threaten both people and nature, and undermine key aspects of the UN Sustainable Development Goals (Auld et al., 2023). The consequences of IUU+ behaviors and their associated crimes are far-reaching: destruction of marine and freshwater ecosystems, loss of human livelihoods and socioeconomic security, safety and human rights of fishing communities, loss of local and national revenues, and unfair competition to legitimate fishermen and businesses (Shaver and Yozell, 2018). And the impacts of IUU+ behaviors extend beyond the fisheries sector as they have been linked to transnational organized crime, illicit financial trade such as trade-based money laundering, and circumvention of trade policies such as tariffs, sanctions, and import prohibitions (May-Mavrellis, 2017).
To deter, detect, and combat IUU+ behaviors and associated trade, governments have responded with a range of regulatory initiatives that span from on-the-water fisheries monitoring, control, and surveillance (MCS) procedures, to port controls (e.g., Port State Measures Agreement, PSMA), to international trade and import control schemes (e.g., U.S. Seafood Import Monitoring Program, and the European Union’s IUU Fishing Regulation) (UN FAO, 2026b, a; NOAA Fisheries, 2025; European Commission, 2026). In addition, members of the seafood industry partnered with international organizations to develop a framework for interoperable catch and supply chain data standards through the Global Dialogue on Seafood Traceability (GDST) (GDST, 2026). Many of these programs rely on the collection and transfer of key data elements (KDEs) across supply chain nodes such as vessel identity, harvest location, species, harvest weight, fishing authorization number, and chain-of-custody information (Blaha et al., 2023; EU IUU Coalition, 2025; Hosch and Clarke, 2025).
Despite widespread consensus that IUU+ fishing and related crimes occur globally, there does not exist a comprehensive incident database that systematically catalogs incidents across jurisdictions, source types, and categories of illicit behavior. Some information is reported in the news media while others are scattered across government enforcement actions (e.g., arrests, case filings) and still others are in detailed academic publications and NGO reports. The lack of such a comprehensive incident database makes it difficult to obtain a snapshot of fisheries crime, emerging trends and patterns, and where to invest limited surveillance and enforcement resources.
To address these needs, we developed IUU+DB, a global incident database for illegal fishing, seafood fraud, labor abuse, and related illicit activities. Our goal is to leverage advances in LLM-driven information extraction technology to capture incidents, their attributes, taxonomies of misconduct, and relevant KDEs. Such a resource could help provide near-real-time evidence of IUU+ behaviors and how these behaviors are expressed, differentiated, and connected across species and geographies. Such a system would support consistent analysis and provide actionable evidence for researchers and policymakers. For example, under the U.S.’ import control regulation (SIMP), a recent proposal by the overseeing regulatory agency offers the potential to have a two-tier system for seafood import KDE reporting requirements where the agency could periodically review and adjust the species list for each tier based on risk analysis, moving species between tiers as needed (NOAA Fisheries, 2024). For such a system to work, information and data related to risks of IUU fishing, seafood fraud, and potential labor abuses would need to be tracked in near-real time through a comprehensive incident database such as the one described here (Simeone, 2026).
While LLMs have made it easier to rapidly prototype systems such as is presented here, reliable extraction of incidents remains challenging. Source documents can be alternatively scientific or colloquial, considerably lengthy, feature nuanced distinction between IUU+ categories, and contribute a range of potential KDEs. Deduplication (i.e., knowing whether multiple extractions refer to the same incident) is crucial in event reconciliation. While there have been efforts to catalog information on illegal wildlife trade broadly, most work to-date has been focused on enumerating the range of scientific taxonomy involved and is often limited to a specific geographic region. Additionally, these datasets do not contain insights on the underlying crime or illicit behavior committed thus making it difficult to use these datasets to assist in building intervention methods (Stringham et al., 2021; Weissgold, 2024; Marshall et al., 2025). Database efforts that try to capture multiple dimensions about an incident and the underlying crime are presently manually curated (TRAFFIC, 2026; Belhabib, 2026). There is a need for a scalable non-manual way to catalog and classify incidents.
Our contributions are:
-
(1)
We present IUU+DB, an LLM-driven information extraction framework that automatically discovers, extracts, and classifies structured incident data from heterogeneous sources including news articles, academic papers, NGO documents, and government reports.
-
(2)
Through IUU+DB, we are now able to impute global trends in illegal fishing, across more than 140 countries.
-
(3)
Finally, we show through case studies how IUU+DB can provide actionable insights for regulators, streamlining seafood supply chain risk assessments.
2. Background and Related Work
IUU+ fishing and associated trade includes, but is not limited to, activities such as harvesting that violates national or international laws (illegal), fishing that avoids required reporting or monitoring (unreported), fishing that occurs in areas or under conditions where effective management and enforcement do not exist (unregulated), theft or other illicit fish farm operations (illegal aquaculture), species substitution and mislabeling (fraud), forced labor (labor abuses), and tariff evasion (trade barrier and sanctions evasion).
Tracking and quantifying IUU+ activity is a difficult task (UN FAO, 2023). No comprehensive way to track IUU+ behaviors and associated trade presently exists. Prior research on tracking IUU+ is scattered and is either specific to IUU+ behavior-type (Salas et al., 2026; Luque and Donlan, 2019; Temple et al., 2022) or is specific to a particular geography or species (Glaser et al., 2019; WWF, 2014).
Central to how we organize our information classification and extraction is a granular taxonomy of IUU+ behaviors and KDEs. To enumerate IUU+ behaviors, we surveyed the literature and developed a list of 41 behaviors that fall within the seven themes enumerated above. Regarding KDEs, we conducted a literature review on seafood-specific supply chains to identify KDEs that are either already established or are proposed in the literature to fill identified information gaps. We developed 14 groups of KDEs, which, taken together have over 100 KDE fields (see Appendix A).
LLM-Driven Information Extraction
The transition from pretrained language models such as BERT to LLMs, has enabled an increase in the possible complexity of data extraction as the problems move from strictly extractive to generative (Zhang et al., 2025). However, these generative IE methods have notable issues when it comes to reliability and reproducibility (Dathe et al., 2026). To account for this, earlier work has looked at ways to limit hallucination through schema and grounded ontologies (Hodgson et al., 2026; Li et al., 2025). Other approaches extract the exact supporting context alongside the generative IE to evaluate grounding against after the extraction (Spillias et al., 2025), although this can be costly to verify.
3. Methodology
Functionally, we set the minimum information threshold for what qualifies as an incident of IUU+ fishing and associated trade to the common data elements used in crime script analysis: who (actors), what (methods), when (time), and where (location). To ensure broad coverage of IUU+ incidents, we designed our information extraction pipeline around five primary sources: (i) Government documents, such as memorandums and court documentation, (ii) Non-Governmental Organizational reports and press releases, (iii) News articles, (iv) Academic Papers, and (v) Industry journals. These five sources span a variety of input lengths and styles, and we also hypothesized that they will have differing foci of IUU+ incidents.
3.1. Framework Design
IUU+DB is an end-to-end pipeline for turning unstructured documents into structured incident records. It ingests documents, classifies them by source type, and extracts relevant information into a standard output schema. The pipeline is guided by an input schema that defines the document categories to recognize and the key data elements (KDEs) to extract. Fig. 1 gives an overview of the IUU+DB pipeline. Based on the KDEs, IUU+ types, and behaviors identified by our team outlined in Appendix A, we classify each source into one of three main source types: (i) sources that describe one or multiple incidents, (ii) sources that are related to IUU+ fishing but do not pertain to an incident, and (iii) sources unrelated to IUU+ fishing. For each incident, the system extracts information from 14 KDE groups covering 100 possible fields, classified by 7 possible IUU+ types, and 41 behaviors. For sources classified as related to IUU+ but that do not describe a specific incident, the system extracts a set of 3 KDEs. To manage the large extraction task possible, the pipeline first aims to ascertain which KDE groups are present in a document and then extracts fields from only those groups. This effectively reduces the schema information the LLM needs to hold in context at any given moment.
3.2. Document Pre-Processing Module
Document Pre-Processing Module is the first step of IUU+DB and accepts plain text, URLs, and PDFs. For PDFs we use Pytesseract 111https://pypi.org/project/pytesseract/ to perform Optical Character Recognition (OCR) to render documents ingested into plain text. For URLs we first attempt to extract text data with Playwright 222https://playwright.dev/python/; if that fails, we use an LLM to extract the core text. Once the text is extracted, the system removes formatting artifacts, checks for duplicate documents using a text hash, and skips documents that have already been processed. Finally, IUU+DB splits the cleaned text into smaller chunks, embeds them using Qwen3-VL-Embedding-2B 333https://huggingface.co/Qwen/Qwen3-VL-Embedding-2B, and stores them in a vector database for later semantic search. We chose this Qwen model as it supports a token length of up to 32k which is more than enough and supports input that includes a mixture of text and image data, so in the future we can incorporate embedded figures from sources. This step can be parallelized so that large document collections can be processed efficiently.
3.3. Source Classification Module
The Source Classification Module determines what kind of information a document contains. Given a document and a set of possible source types, the module first identifies passages that appear relevant to IUU+ activity.
Our framework uses few shot prompting to classify the document as one of the following: a source describing an IUU+ incident, a source related to IUU+ fishing but with either more general information or not enough sufficient information provided to be considered a specific incident, or a source unrelated to IUU+ fishing. Because a single document may describe more than one incident, this module also extracts and stores the passages associated with each incident. This helps the KDE extraction module process each incident separately and reduces confusion in later stages.
3.4. KDE Extraction Module
The KDE Extraction Module is the workhorse of the IUU+DB system. It takes the source document, scope classification, as well as optionally relevant passages and a list of KDE groups. The extractor then makes runs where is the number of relevant passages, or if the classifier did not include them. In each run the extractor first determines the presence of each KDE group given before running the extraction of each KDE fields within the group. If the KDE groups were not given the module defaults to extracting every field. For our implementation we again utilized an LLM with DSPy444https://dspy.ai/. DSPy helps us define a structured schema for extraction and encourages logical consistency across fields. For example, if the system identifies a seafood product, we expect it to also identify the species from which that product is made. From the three source classification, we identify two schema to extract: one for incidents, and another albeit much smaller schema for related to IUU+ fishing which tracks species, countries, and companies mentioned. We do not capture any additional information from unrelated articles. In our experience, we found that some KDEs are relatively rare, and unlikely to occur together within the same incident (e.g. It would be uncommon for there to be a single IUU+ incident as we have described them, to include data about both catch AND aquaculture.) We grouped KDEs therefore by those most likely to co occur with one another and extract them in the modules runs. After extracting the KDEs we then classify the IUU+ behavior exhibited in the incident.
3.5. Deduplication and Trend Identification
After extraction, the incidents are aggregated in two ways, first by merging duplicate incidents and secondly by grouping related incidents into trends. The De-Duplication and Trend Identification modules first identifies similar incidents using cosign similarity against their sources, we then check whether the duplicate candidates have matching identification information such as vessel name, or id and matching event or enforcement dates. If the two incidents pass those filters and have high cosign similarity, the module merges the extracted KDEs. In the cases of conflicts, the module queries all attached sources for the relevant KDE field in order to get a consensus.
3.6. Prompt Optimization
Using DSPy’s prompt optimizers and manually validated data we automate the iterative task of prompt fine tuning. we utilized MIPROv2 (Opsahl-Ong et al., 2024) to iteratively finetune the prompt and field descriptions themselves. For our metric we assigned rewards for having a correctly filled KDE field with high cosine similarity between the extraction and the validated field. To penalize hallucinations we negatively reinforce incorrect extractions.
3.7. Datasets
We collected data for this project in two primary ways, directly from APIs and from websites via webscrapers. We used the following APIs: NewsAPI (NewsAPI.ai, 2026), SerpAPI (SerpAPI.com, 2025), and CORE API (Knoth et al., 2023). And also targeted the following websites via webscrapers: MongaBay (MongaBay, 2026), US DOJ (US DOJ, 2026), Oceana (Oceana, 2026), US NOAA Fisheries (NOAA Fisheries, 2026), and Undercurrent News (Undercurrent News, 2026a). To create queries for candidate source discovery, we used the previously defined IUU+ types and behaviors to build queries that would capture the wide range of behaviors (see Appendix A).
For webscraping, we separated websites into two groups by how specific the sites were to the fish-seafood sector. For more general sites, e,g., MongaBay and DOJ, we searched for the following key words: ”illegal fishing”, ”IUU fishing”, ”overfishing”, ”fishing violation”, ”illegal catch”, ”seafood fraud”, ”forced labor fishing”, ”illegal aquaculture”, and ”illegal seafood sanctions”. In all these searches, our aim was to capture any relevant articles capturing the illicit behavior as well as narrow the scope to only fishing and associated trade. Since the second group of sites focus solely on the fish-seafood sector (e.g., nongovernmental organization (NGO) websites, NOAA Fisheries, and Undercurrent News), it was the case that if the article referred to an infraction it was highly likely to be about IUU+ behaviors. As such, we selected the following keywords: ”violence”, ”investigation”, ”coercion”, ”arrest”, ”charge”, ”indict”, ”fined”, ”enforce”.
With the three APIs we can use more complex logic and selected the following query: ”IUU or seafood mislabeling or ((Transhipment or sanctions or unfree labor or workplace violations or wage theft) and (ship or seafood or fish) and (arrest or criminal investigation or indictment or search or seizure or fine))”. From NewsAPI, instead of using keywords, the query was made up of ”concepts” which utilize a semantic searching mechanism allowing us to pull articles from different languages, and related articles that may not use the exact key words.
3.8. Web Interface
In addition to exposing APIs for general use and further analysis, we developed a web interface to help subject-matter experts analyze, edit, and audit the data. The interface has five main pages. The Insights page shows statistics and figures from the database. The Sources page allows users to read and explore the documents that have been ingested. The Incidents and Related to IUU+ pages allow users to review the KDEs extracted under each schema. The Upload page allows administrative users to add new documents to the database from plain text, URLs, or PDFs. The Sources, Incidents, and Related to IUU+ pages also include version history, so administrators can track changes made to each record (see fig 2).
4. Experimental Results
IUU+DB ingested articles over 11 years from 2,472 sources across 143 different countries yielding a total of 8,435 incidents. Our evaluation is focused on addressing the following questions:
-
•
Can IUU+DB be used by SMEs to identify global trends in illegal fishing? (§4.1)
-
•
Does the distribution of KDE extractions follow our expectations of IUU+ behavior types? (§4.2)
-
•
Can biclustering on (KDE fields, Behavior types) reveal insights into reporting patterns? (§4.2)
-
•
Can IUU+DB correctly identify and extract IUU+ incidents and KDEs? (§4.4)
-
•
What are the most surprising incidents tracked? (§4.3)
-
•
Does IUU+DB improve extraction/classification tasks vs baseline LLMs like GPT-4o-Mini? (§4.4)
-
•
Does IUU+DB demonstrate improved performance vs. newer/more powerful models such as GPT-5.4? (§4.5)
4.1. Hotspot Analysis
We find that the data set and IUU+DB is capturing a wide scope of incidents, from high profile cases (AFP, 2022; Undercurrent News, 2025), to regional conflicts (Firstpost.com, 2023; Island.lk, 2023; Undercurrent News, 2024), to local news (Patrick Gouldsbrough, 2022; Environment Agency of the United Kingdom, 2024).
From Fig. 3 we can see that while English speaking nations are highly represented, the dataset has captured events from every continent and nearly every coastal state. The two largest trends we extracted were Illegal Red Snapper harvesting in the Gulf of Mexico and Sri Lankan-Indian fishing rights conflicts.
| Country | Incident |
|---|---|
| Brazil | Murder of Bruno Pereira & Dom Philips (AFP, 2022) |
| UK | Illegal Spear Fishing (Patrick Gouldsbrough, 2022) |
| Alaska, USA | North Star Fishing Co. Obstruction (Undercurrent News, 2026b) |
| Gulf of Mexico, USA | Illegal Red Snapper Harvesting (Undercurrent News, 2024) |
| Nova Scotia | Illegal Eel-Elving Harvesting (Keith Doucette, 2023) |
| British Columbia | 30k Pounds of Tuna Seized (Fisheries and Oceans Canada, 2023) |
| Sri Lanka | Navy Arrests 14 Indian Fishermen (Firstpost.com, 2023) |
| Philippines | Poaching in Marine Reserve (Benjie B. Talisic, Tito P. Tan, 2022) |
4.2. KDE Prevalence Analysis
For a given incident, IUU+DB extracts roughly 20% of the fields in the full schema; see Fig. 4. This is not surprising given the size of the schema and the kinds of articles that make up most of the database. We next asked whether IUU+DB fills the fields we would expect for each IUU+ classification. For this purpose, we first analyzed the KDE Presence detection module, which decides whether the system should search for fields within a given KDE group. We then measured KDE group presence across IUU+ types. Figures 4 and 5 show that some KDE groups, such as event and compliance information, appear across many IUU+ types. However, behavior-specific KDEs are much more common in the categories where they are expected. For example, labor standards and crew information increase from 7% and 16% across all cases to 81% and 50% in labor abuse cases. Similarly, aquaculture information increases from 3% across all cases to 70% in aquaculture-related cases.
The second analysis we used biclustering to find coordinated occurrences of KDEs and IUU+ sub behaviors. We use the FP-Max algorithm 111https://rasbt.github.io/mlxtend/user_guide/frequent_patterns/fpmax/ wherein each transaction is modeled as a KDE with one or more IUU+ subtypes. To account for data imbalance, we ran the biclustering in two stages. The first stage was conducted with a minimum support threshold of 0.25 (which yields primarily transactions of IUU+ type of ”Illegal Fishing”), and the second with a minimum support threshold of 0.1 after filtering out transactions from the first step. The maximum set bicluster we found was made up of the following 10 KDEs all from the Event group: event and enforcement country, event and enforcement location, and event and enforcement location category, enforcement category, primary offender, event date, and resolution. This cluster was supported by 45% of transactions and by 36 unique IUU+ sub-behaviors.
In the second run, there were four clusters each with 10 KDEs, with the most common being the made up of the same 10 KDEs from the first run. The other three were the same as the set above, less event date and with a different KDE instead that is highly relevant to a specific IUU+ behavior or type. Such as, species or product information for Species Fraud and Mislabeling, or crew information information in cases of Labor Abuse.
4.3. Maximum Entropy Analysis
We aimed to determine the most surprising illegal fishing incident that occurred within our dataset. For this purpose, we employed a maximum entropy approach. We described each event in terms of an (enforcement country, Illegal fishing sub behavior) tuple. We learn a maximum entropy distribution using iterative proportional fitting (IPF). For each quarter we use incidents from the past year (i.e., past four quarters) to populate the seed matrix which IFP uses to estimate the current quarter. For each cell in the country/sub behavior matrix we then calculated the difference between the estimated and observed, and render it as a z-score. Using to define a surprising event, we discover 46 surprising country-behavior events, the top 6 seen in fig 2. Examples of these identified surprising incidents are the USA Q2 2022, and LKA 2024 Q3, the first of which involved two Florida men fishing prior to the season starting and catching undersized fish (Undercurrent News, 2022), and the second was a case of falsifying tracking data in Saya de Malha, northeast of Madagascar (Ian Urbina, 2024).
| Country | Qtr | IUU+ Sub Behavior | Z Score |
|---|---|---|---|
| USA | 2022Q2 | Keeping Undersized fish | 26.6 |
| China | 2022Q2 | Falsifying Documents | 25.4 |
| Sri Lanka | 2024Q3 | Falsifying Documents | 17.3 |
| United Kingdom | 2023Q4 | Keeping Undersized fish | 14.2 |
| Ireland | 2022Q2 | Falsifying Documents | 10.4 |
| United Kingdom | 2022Q2 | Prohibited Fishing Gear | 10.3 |
4.4. Evaluation
For our evaluation set, we had three validators manually label a random selection of 103 source documents, 52 of which were initially classified by IUU+DB as incidents, 25 were related to IUU+ but not involving an incident, and 26 unrelated to IUU+. The validators read through the document and determined the source scope. If the validator found the document to be an incident, they proceeded to classify the incident from the 7 IUU+ types and 41 sub behaviors. Finally the validators extracted the 100 KDEs and enter them into IUU+DB. We log any differences and update the database with the human labeled data.
We first evaluate classification performance on the manually validated test set. We report precision, recall, and F1 score for document scope, IUU+ type, and IUU+ sub-behavior. We use the same metrics to evaluate extraction performance at the field level, and then aggregate the results by KDE group. We also report a mismatch rate. This measures how often the system’s extracted value differs from the validated value when both contain a populated field.
| Classification | Precision | Recall | F1 |
|---|---|---|---|
| Source Scope (macro) | 0.70 | 0.69 | 0.64 |
| Incident | 0.50 | 1.00 | 0.67 |
| Related, no incident | 0.64 | 0.55 | 0.59 |
| Unrelated | 0.96 | 0.52 | 0.68 |
| IUU Type (micro) | 0.82 | 0.87 | 0.84 |
| IUU Type (macro) | 0.84 | 0.87 | 0.82 |
| IUU Sub-Behavior (micro) | 0.66 | 0.81 | 0.73 |
| IUU Sub-Behavior (macro) | 0.74 | 0.81 | 0.75 |
| KDE Group | Precision | Recall | F1 | Mismatch Rate |
|---|---|---|---|---|
| Species | 0.71 | 0.71 | 0.71 | 0.86 |
| Products | 0.67 | 0.56 | 0.61 | 0.88 |
| Compliance | 0.94 | 0.64 | 0.76 | 0.07 |
| Vessel | 0.86 | 0.48 | 0.62 | 0.17 |
| Aggregation | 0.90 | 0.36 | 0.51 | 0.20 |
| Landing | 1.00 | 0.36 | 0.53 | 0.00 |
| Distribution | 1.00 | 0.36 | 0.53 | 0.00 |
| Crew | 0.71 | 0.20 | 0.31 | 0.40 |
| Aquaculture | 0.83 | 0.20 | 0.32 | 0.33 |
| Event | 0.39 | 0.36 | 0.38 | 0.64 |
| Labor Standards | 0.67 | 0.32 | 0.43 | 0.67 |
| Catch | 0.55 | 0.24 | 0.33 | 0.71 |
| Transshipment | 0.83 | 0.20 | 0.32 | 0.50 |
| Trade | 0.67 | 0.24 | 0.35 | 0.60 |
The classification results in Table 3 show that IUU+DB performs reasonably well across the main classification tasks. However, the confusion matrix, seen in fig 6 and the document-scope results show some weaknesses. For example, incident classification has a precision of 0.50 and a recall of 1.00. This suggests that the classifier is tuned to avoid missing incidents, but it also marks too many documents as incidents. This low precision adds noise to the database and makes downstream analysis more difficult.
For sources classified as incidents, we evaluated extraction performance; the results are shown in Table 4. We observe four main patterns: high recall with high mismatch, moderate-to-high recall with low mismatch, low recall with low mismatch, and low-to-moderate recall and high mismatch. In the high-recall, high-mismatch groups, such as species and products, the system often finds the relevant information but also includes noisy or incorrect values. In the moderate-to-high-recall, low-mismatch groups, such as compliance and vessel information, the system performs well and extracts accurate information at a reasonable rate. In the low-recall, low-mismatch groups, such as aggregation, distribution, and landing, the system often misses relevant fields. However, when it does extract them, the values are usually correct. Finally the last group are the fields that the system does not often extract, and when it does it often struggles to do so with high degrees of accuracy, such as information regarding catch. or labor standards. These KDE groups highlight areas for improvement with more training examples for prompt optimization.
From these statistics we can see that while IUU+DB generally performs well, there is more work needed to reduce the amount of noise, by improving the precision of the scope classifier and reducing the mismatch rate.
4.5. Ablation Comparison against Baselines
To evaluate whether IUU+DB improves performance we compare the results of the test set above against two base lines: GPT-4o Mini (OpenAI et al., 2024), the model used in our implementation, as well as GPT 5.4 Mini (OpenAI et al., 2026) to test how IUU+DB performs w.r.t. newer models.
| Source | IUU Type | IUU Sub-Behavior | |||
|---|---|---|---|---|---|
| Model | F1 | macro F1 | micro F1 | macro F1 | micro F1 |
| GPT-4o Mini | 0.62 | 0.70 | 0.60 | 0.58 | 0.53 |
| GPT-5.4 Mini | 0.62 | 0.76 | 0.74 | 0.62 | 0.71 |
| IUU+DB | 0.65 | 0.84 | 0.82 | 0.73 | 0.75 |
As can be seen from Table 5, all three models perform well. IUU+DB offers a slight improvement on the initial task of classifying the scope against both stand alone LLM calls, but shows 15-20% improvement over the plain GPT-4o calls and 10% improvement over the newer GPT-5.4 model for IUU+ Type and Sub-Behavior respectively. Additionally, we noticed that both GPT-4o Mini and GPT-5.4 Mini extracted about 5 fields fewer than the human extraction or than IUU+DB did; see Fig. 7.
5. Ethical Considerations
Intellectual Property. This project aggregates IUU+ content from a range of sources, including some that are behind paywalls or have restrictions on rehosting under their Terms of Service. When paywalled articles were used, we accessed them through legitimate paid subscriptions or purchased access; the system does not scrape or redistribute protected content without authorization. To respect publisher rights and revenue models, IUU+DB does not show full article text to end users. Instead, users see a summary, source metadata, and a link to the original document. Full text is retained only on the backend for indexing, validation, and audit purposes, and access is limited to a small set of administrative users.
Biased Sources. The source documents that are aggregated reflect the perspectives of the organizations and states that produce them. There may be cases where the illegality of an action is disputed between states, e.g., fishing in disputed waters. This project does not aim to adjudicate these disputes nor claim that all cases within are facts, rather we aim to record claimed instances of IUU+ activity for analysis.
Territorial Ambiguity. Some incidents in this dataset occur within territories with contested recognition under international law, such as Somaliland. Assigning an incident to a country necessarily requires choices that may favor one political stance over another. For this project, we match locations to ISO-3166 country codes in order to standardize country names. For events that take place within areas that do not have a ISO code we register any location data available and leave the country field empty.
6. Discussion
6.1. Limitations
Source Equivalence and Factual Reliability
As previously mentioned, a wide variety of sources are ingested, with a range of credibility. We currently treat all sources as equivalently trustworthy, despite large differences in the evidentiary standards (e.g., a small news organization versus a peer reviewed paper). Users should interpret the incidents as claims of incidents rather than ground truthed fact. Additionally, our data are mostly from English language sources, and refer broadly to events in countries where news is reported in English.
Data Extraction Quality
IUU+DB is limited by the quality of the source material when determining KDEs like the location and species involved. For example, some sources do not specify any species details at all (Environment Agency of the United Kingdom, 2024). Others use generic species group names rather than an exact species distinguisher (e.g., names like ‘tuna’, ‘salmon’, ‘crab’, instead of the specific species names such as ‘yellowfin tuna’, ‘sockeye salmon’, ‘blue swimming crab’). Meanwhile, other sources include the full scientific name (e.g., ‘Thunnus albacares’ for yellowfin tuna).
Incident Bias
The presence of incident reporting should not be the only evaluation criterion used to determine a holistic perspective on the occurrence of IUU+ behaviors. The stories that are covered in the news and through online government agency press releases mean that IUU+ behaviors are being detected and enforcement is taking place. However, in geographies and jurisdictions where enforcement is lacking or non-existent, there will be few incidents to catalog. Nevertheless, there still may be IUU+ behaviors occurring but the lack of enforcement means they are poorly cataloged in our database.
6.2. Future Work
Future work will focus on improving extraction accuracy and strengthening source-scope and IUU+ type classification. Better classification will reduce noise in the database and make it easier for subject matter experts to identify useful patterns and insights. We will also use the generated labels, along with the manually labeled fine-tuning set, to develop and evaluate alternative classification modules that can improve IUU+DB’s performance.
We also plan to expand the system to ingest image and figure data. Some important information may appear only in graphics, tables, maps, or other visual formats, and adding this capability would allow IUU+DB to capture a broader range of evidence from each source.
Another area of future work is the development of a more user-friendly record validation pipeline that can be opened up to external users. As the database grows, expert validation will remain important for improving record quality, correcting errors, and helping the system learn from reviewed examples.
Finally, we plan to move IUU+DB toward a regularly updated database that can be refreshed on a weekly basis. This will include improved extraction pipelines for each source type and broader coverage of web-scraped sources, including additional NGO, government, industry, and media sources from around the world.
7. Conclusion
This paper offers IUU+DB as a flexible framework for extracting complex schema from variable length sources at scale. IUU+DB is a tool unique in its scope and potential for near-real time of coverage of IUU+ incidents, giving it the potential to be highly useful to regulators or stakeholder groups. Users can use IUU+DB to quickly compile available recent information on a country’s or fishery’s IUU+ incidents, apply risk-based screening to seafood trade controls, or better understand links between KDEs (e.g., certain fishing practices and labor abuses) and underlying IUU+ incidents. These varied uses for IUU+DB can help improve fishery conservation and social outcomes.
Our results also show that IUU+DB can surface patterns that are difficult to see from individual reports alone. These include the distribution of incidents across IUU+ behavior types, countries, species, and source categories, as well as co-occurrence patterns among IUU+ types and behaviors. Future analyses can use these structured records to identify emerging hotspots, compare reporting patterns across source types, and better understand how different forms of IUU+ activity are connected. This makes IUU+DB not only a database, but also a tool for generating evidence that can inform policy, enforcement priorities, and future research.
Appendix A Appendix: IUU+ Behaviors and KDE Data Dictionary
We define 7 IUU+ types each with a set of related behaviors. They are as follows: 1) Illegal Fishing, constituent behaviors include: exceeding catch quotas, keeping undersized fish, catching unauthorized, protected or prohibited species, the use of banned or prohibited fishing gear, fishing in closed areas or during closed seasons, obscuring vessel identity, falsifying documents, engaging in unauthorized transshipment, engaging in illegal bycatch; 2) Unreported Fishing, constituent behaviors include: unreported or underreported target catch weight or size, unreported or underreported discards, bycatch size or weight, misreported target catch species, misreported non target species, misreported gear, misreported fishing time or location; 3) Unregulated Fishing, constituent behaviors include: dishing without a flag state, fishing under flag state not party to RFMO, fishing in unregulated areas or for unregulated stock; 4) Seafood Fraud or Mislabeling, constituent behaviors include: species mislabeling or fraud, production information fraud; 5) Forced Labor or Labor Abuse, constituent behaviors include: wage and pay violations, abusive living conditions, abusing working conditions, inadequate crew size, physical or sexual violence, workplace intimidation, workers families threatened, isolation, migrants threatened; 6) Circumventing Prohibitions or Sanctions, constituent behaviors include: circumventing sanctions (individuals or corporations), circumventing import prohibitions (countries or products); 7) Aquaculture Specific Activities, constituent behaviors include: unapproved or non-native species, illegal sourcing of broodstock or seed, misrepresentation or unauthorized farm operations, unlicensed, unregistered, or unauthorized farm operations, and stolen aquaculture products.
We identified the following 14 groups of KDEs: Species, Product, Event, Vessel, Crew, Labor, Catch, Compliance, Aquaculture, Transshipment, Aggregation, Trade, Distribution, Landing. Taken together these groups have 100 KDE fields.
References
- AFP (2022) Fish trade’s murky waters cloud double murder in amazon. Note: Accessed: 13 May 2026 External Links: Link Cited by: §4.1, Table 1.
- D. J. Agnew, J. Pearce, G. Pramod, T. Peatman, R. Watson, J. R. Beddington, and T. J. Pitcher (2009) Estimating the Worldwide Extent of Illegal Fishing. PLOS ONE 4 (2), pp. e4570 (en). External Links: ISSN 1932-6203, Link, Document Cited by: §1.
- K. Auld, R. Baumler, D. P. Han, and F. Neat (2023) The collective effort of the United Nations Specialised Agencies to tackle the global problem of illegal, unreported and unregulated (IUU) fishing. Ocean & Coastal Management 243, pp. 106720. External Links: ISSN 0964-5691, Link, Document Cited by: §1.
- D. Belhabib (2026) Spyglass: Global Fishing Crimes Map. Ecotrust Canada. External Links: Link Cited by: §1.
- Benjie B. Talisic, Tito P. Tan (2022) 3 nabbed for illegal fishing in bohol. Note: Accessed: 13 May 2026 External Links: Link Cited by: Table 1.
- F. Blaha, A. Vincent, and Y. Piedrahita (2023) Guidance document: Advancing end-to-end traceability. Critical tracking events and key data elements along capture fisheries and aquaculture value chains. FAO, Rome, Italy (English). External Links: ISBN 978-92-5-137835-9, Link, Document Cited by: §1.
- A. Dathe, K. Hoffmann, and A. Mangold (2026) Useful for exploration, risky for precision: evaluating ai tools in academic research. External Links: 2605.10125, Link Cited by: §2.
- Environment Agency of the United Kingdom (2024) 4 licence dodgers receive fines of £710 for fishing illegally. Note: Accessed: 13 May 2026 External Links: Link Cited by: §4.1, §6.1.
- EU IUU Coalition (2025) Import control schemes in major seafood markets: a comparative study of key data elements in the European Union, the United States, Japan and the Republic of Korea. Technical report EU IUU Coalition. External Links: Link Cited by: §1.
- European Commission (2026) EU rules to combat IUU fishing. (en). External Links: Link Cited by: §1.
- Firstpost.com (2023) Sri lanka arrests 14 indian fishermen for ’poaching’ in its waters, in all 240 so far this year. Note: Accessed: 13 May 2026 External Links: Link Cited by: §4.1, Table 1.
- Fisheries and Oceans Canada (2023) Owners of canadian fishing vessel ocean provider fined and over 30,000 pounds of tuna seized. Note: Accessed: 13 May 2026 External Links: Link Cited by: Table 1.
- GDST (2026) Global Dialogue on Seafood Traceability. (en-GB). External Links: Link Cited by: §1.
- S. M. Glaser, P. M. Roberts, and K. J. Hurlburt (2019) Foreign Illegal, Unreported, and Unregulated Fishing in Somali Waters Perpetuates Conflict. Frontiers in Marine Science 6 (English). External Links: ISSN 2296-7745, Link, Document Cited by: §2.
- R. Y. Hodgson, S. A. Robinson, A. C. Boutin, F. K. Chan, J. R. Bennett, R. T. Buxton, J. H. Caufield, D. E. L. Hanna, and T. Alamenciak (2026) Assessing the effectiveness of ontology-grounded ai term extraction using ontogpt for environmental evidence synthesis. Environmental Evidence. Cited by: §2.
- G. E. Hosch and S. C. Clarke (2025) Advancing the Crucial Notion of “Interoperability” in Catch Documentation Schemes. Fisheries Management and Ecology (en). External Links: Link, Document Cited by: §1.
- Ian Urbina (2024) Two men charged with getting early illegal jump on floridas spiny lobster season. Note: Accessed: 13 May 2026 External Links: Link Cited by: §4.3.
- Island.lk (2023) Navy detain two indian trawlers 25 fisherman poaching in sri lankan waters. Note: Accessed: 13 May 2026 External Links: Link Cited by: §4.1.
- Keith Doucette (2023) Officers seize $500,000 worth of baby eels outside halifax amid fishery closure. Note: Accessed: 13 May 2026 External Links: Link Cited by: Table 1.
- P. Knoth, D. Herrmannova, M. Cancellieri, L. Anastasiou, N. Pontika, S. Pearce, B. Gyawali, and D. Pride (2023) CORE: a global aggregation service for open access papers. Nature Scientific Data 10 (1), pp. 366 (en). Cited by: §3.7.
- S. Li, A. Sadekar, N. Self, Y. Su, L. Andersland, M. Chaplin, A. Zhang, H. Yang, J. B. Henderson, K. Wigginton, et al. (2025) Exploring llms for scientific information extraction using the sciex framework. arXiv preprint arXiv:2512.10004. External Links: Document Cited by: §2.
- G. M. Luque and C. J. Donlan (2019) The characterization of seafood mislabeling: A global meta-analysis. Biological Conservation 236, pp. 556–570. External Links: ISSN 0006-3207, Link, Document Cited by: §2.
- B. M. Marshall, C. T. Strine, M. L. Gore, E. A. Eskew, O. C. Stringham, P. Cardoso, S. Chekunov, F. Watters, C. Fukushima, P. García-Díaz, J. S. Sinclair, M. F. Tlusty, R. J. Almeida, J. W. Valdez, and A. C. Hughes (2025) Mapping the global dimensions of US wildlife imports. Current Biology 35 (16), pp. 3959–3972.e4. External Links: ISSN 0960-9822, Link, Document Cited by: §1.
- C. May-Mavrellis (2017) Transnational Crime and the Developing World. Technical report Global Financial Integrity (en). External Links: Link Cited by: §1.
- MongaBay (2026) Note: Accessed: 2 November 2025 External Links: Link Cited by: §3.7.
- NewsAPI.ai (2026) Real-time and historical news data. Note: Accessed: 2 November 2025 External Links: Link Cited by: §3.7.
- NOAA Fisheries (2024) Action Plan to Improve the U.S. Seafood Import Monitoring Program. Technical report NOAA. External Links: Link Cited by: §1.
- NOAA Fisheries (2025) Seafood Import Monitoring Program (SIMP). NOAA Fisheries (en). Note: Archive Location: International External Links: Link Cited by: §1.
- NOAA Fisheries (2026) United states national oceanic and atmostsperic administration national marine fisheries service. Note: Accessed: 2 November 2025 External Links: Link Cited by: §3.7.
- Oceana (2026) Note: Accessed: 2 November 2025 External Links: Link Cited by: §3.7.
- OpenAI, A. Hurst, A. Lerer, A. P. Goucher, A. Perelman, and A. R. et al. (2024) GPT-4o system card. External Links: 2410.21276, Link Cited by: §4.5.
- OpenAI, A. Singh, A. Fry, A. Perelman, A. Tart, A. Ganesh, A. El-Kishky, A. McLaughlin, and A. L. et al (2026) OpenAI gpt-5 system card. External Links: 2601.03267, Link Cited by: §4.5.
- K. Opsahl-Ong, M. J. Ryan, J. Purtell, D. Broman, C. Potts, M. Zaharia, and O. Khattab (2024) Optimizing instructions and demonstrations for multi-stage language model programs. External Links: 2406.11695, Link Cited by: §3.6.
- Patrick Gouldsbrough (2022) County durham man fined for killing fish with spear in river wear. Note: Accessed: 13 May 2026 External Links: Link Cited by: §4.1, Table 1.
- D. Salas, A. Sheik Heile, M. A. Schnurr, M. de Jong, M. Duff, and K. Swanson (2026) Seas of unfreedom: A scoping review of labour exploitation as a structural feature of global fisheries. Marine Policy 191, pp. 107147. External Links: ISSN 0308-597X, Link, Document Cited by: §2.
- SerpAPI.com (2025) Google scholar api. Note: Accessed: 2 November 2025 External Links: Link Cited by: §3.7.
- A. Shaver and S. Yozell (2018) Casting a Wider Net: The Security Implications of Illegal, Unreported, and Unregulated Fishing. Technical report The Stimson Center (en-US). External Links: Link Cited by: §1.
- J. C. Simeone (2026) U.S. Imports of Fish and Seafood: An Evaluation of Coverage under the Seafood Import Monitoring Program. Technical report U.S. IUU Fishing & Labor Rights Coalition (en-US). External Links: Link Cited by: §1.
- S. Spillias, K. M. Ollerhead, M. Andreotta, R. Annand-Jones, F. Boschetti, J. Duggan, D. B. Karcher, C. Paris, R. J. Shellock, and R. Trebilco (2025) Evaluating generative ai for qualitative data extraction in community-based fisheries management literature. Environmental Evidence. Cited by: §2.
- O. C. Stringham, S. Moncayo, E. Thomas, S. Heinrich, A. Toomes, J. Maher, K. G. W. Hill, L. Mitchell, J. V. Ross, C. R. Shepherd, and P. Cassey (2021) Dataset of seized wildlife and their intended uses. Data in Brief 39, pp. 107531. External Links: ISSN 2352-3409, Link, Document Cited by: §1.
- A. J. Temple, D. J. Skerritt, P. E. C. Howarth, J. Pearce, and S. C. Mangi (2022) Illegal, unregulated and unreported fishing impacts: A systematic review of evidence and proposed future agenda. Marine Policy 139, pp. 105033. External Links: ISSN 0308-597X, Link, Document Cited by: §2.
- TRAFFIC (2026) Wildlife Trade Portal. External Links: Link Cited by: §1.
- UN FAO (2023) Quantifying IUU fishing. Implementation of the International Plan of Action to Deter, Prevent and Eliminate Illegal, Unreported and Unregulated Fishing., UN FAO (en). External Links: ISBN 978-92-5-137937-0, Link, Document, Document Cited by: §2.
- UN FAO (2026a) Agreement on Port State Measures (PSMA). Food and Agriculture Organization of the United Nations. External Links: Link Cited by: §1.
- UN FAO (2026b) Monitoring, control and surveillance systems to combat illegal, unreported and unregulated fishing. (en). External Links: Link Cited by: §1.
- Undercurrent News (2022) Two men charged with getting early illegal jump on floridas spiny lobster season. Note: Accessed: 13 May 2026 External Links: Link Cited by: §4.3.
- Undercurrent News (2024) Mexican drug cartel linked to illegal red snapper harvests in us waters. Note: Accessed: 13 May 2026 External Links: Link Cited by: §4.1, Table 1.
- Undercurrent News (2025) Canadian man gets cad 1m fine, six years in jail for illegal fishing. Note: Accessed: 13 May 2026 External Links: Link Cited by: §4.1.
- Undercurrent News (2026a) Note: Accessed: 2 November 2025 External Links: Link Cited by: §3.7.
- Undercurrent News (2026b) North star faces $53,000 fine over alaska violations. Note: Accessed: 13 May 2026 External Links: Link Cited by: Table 1.
- US DOJ (2026) United states department of justice. Note: Accessed: 2 November 2025 External Links: Link Cited by: §3.7.
- B. J. Weissgold (2024) US wildlife trade data lack quality control necessary for accurate scientific interpretation and policy application. Conservation Letters 17 (2), pp. e13005 (en). Note: _eprint: https://conbio.onlinelibrary.wiley.com/doi/pdf/10.1111/conl.13005 External Links: ISSN 1755-263X, Link, Document Cited by: §1.
- WWF (2014) Illegal Russian Crab: An Investigation of Trade Flow. Technical report World Wildlife Fund. External Links: Link Cited by: §2.
- Z. Zhang, W. You, T. Wu, X. Wang, J. Li, and M. Zhang (2025) A survey of generative information extraction. In Proceedings of the 31st International Conference on Computational Linguistics, O. Rambow, L. Wanner, M. Apidianaki, H. Al-Khalifa, B. D. Eugenio, and S. Schockaert (Eds.), Abu Dhabi, UAE, pp. 4840–4870. External Links: Link Cited by: §2.
