VOOZH about

URL: https://huggingface.co/datasets/gplsi/alia_intellectual_property

⇱ gplsi/alia_intellectual_property · Datasets at Hugging Face


Dataset Preview
Duplicate
format
string
source
string
language
string
content
string
metadata
dict
md
EURLEX
en
| | | | | | --- | --- | --- | --- | | 16.5.2012 | EN | Official Journal of the European Union | L 129/1 | --- REGULATION (EU) No 386/2012 OF THE EUROPEAN PARLIAMENT AND OF THE COUNCIL of 19 April 2012 on entrusting the Office for Harmonization in the Internal Market (Trade Marks and Designs) with tasks related ...
{ "source": "https://eur-lex.europa.eu/legal-content/AUTO/?uri=CELEX:32012R0386&qid=1761046312900&rid=1", "title": "Regulation (EU) No 386/2012 of the European Parliament and of the Council of 19 April 2012 on entrusting the Office for Harmonization in the Internal Market (Trade Marks and Designs) with tasks relate...
md
EURLEX
en
| | | | | | --- | --- | --- | --- | | 30.4.2004 | EN | Official Journal of the European Communities | L 157/45 | --- DIRECTIVE 2004/48/EC OF THE EUROPEAN PARLIAMENT AND OF THE COUNCIL of 29 April 2004 on the enforcement of intellectual property rights (Text with EEA relevance) THE EUROPEAN PARLIAMENT AND THE ...
{ "source": "https://eur-lex.europa.eu/legal-content/AUTO/?uri=CELEX:32004L0048&qid=1761046312900&rid=2", "title": "DIRECTIVE 2004/48/EC OF THE EUROPEAN PARLIAMENT AND OF THE COUNCIL of 29 April 2004 on the enforcement of intellectual property rights (Text with EEA relevance)", "language": "en", "celex": "32004...
md
EURLEX
en
| | | | | | --- | --- | --- | --- | | 29.6.2013 | EN | Official Journal of the European Union | L 181/15 | --- REGULATION (EU) No 608/2013 OF THE EUROPEAN PARLIAMENT AND OF THE COUNCIL of 12 June 2013 concerning customs enforcement of intellectual property rights and repealing Council Regulation (EC) No 1383/20...
{ "source": "https://eur-lex.europa.eu/legal-content/AUTO/?uri=CELEX:32013R0608&qid=1761046312900&rid=3", "title": "Regulation (EU) No 608/2013 of the European Parliament and of the Council of 12 June 2013 concerning customs enforcement of intellectual property rights and repealing Council Regulation (EC) No 1383/2...
md
EURLEX
en
| | | | | | --- | --- | --- | --- | | 27.12.2006 | EN | Official Journal of the European Union | L 376/28 | --- DIRECTIVE 2006/115/EC OF THE EUROPEAN PARLIAMENT AND OF THE COUNCIL of 12 December 2006 on rental right and lending right and on certain rights related to copyright in the field of intellectual proper...
{ "source": "https://eur-lex.europa.eu/legal-content/AUTO/?uri=CELEX:32006L0115&qid=1761046312900&rid=4", "title": "Directive 2006/115/EC of the European Parliament and of the Council of 12 December 2006 on rental right and lending right and on certain rights related to copyright in the field of intellectual proper...
md
EURLEX
en
"[**Avis juridique important**](../../../editorial/legal_notice.htm)\n\n*|*\n\n# 32001L0029\n\n**Dir(...TRUNCATED)
{"source":"https://eur-lex.europa.eu/legal-content/AUTO/?uri=CELEX:32001L0029&qid=1761046312900&rid=(...TRUNCATED)
md
EURLEX
en
"| | | | |\n| --- | --- | --- | --- |\n| 23.12.2015 | EN | Official Journal of the European Unio(...TRUNCATED)
{"source":"https://eur-lex.europa.eu/legal-content/AUTO/?uri=CELEX:32015L2436&qid=1761046312900&rid=(...TRUNCATED)
md
EURLEX
en
"| | | | |\n| --- | --- | --- | --- |\n| 16.6.2017 | EN | Official Journal of the European Union(...TRUNCATED)
{"source":"https://eur-lex.europa.eu/legal-content/AUTO/?uri=CELEX:32017R1001&qid=1761046312900&rid=(...TRUNCATED)
md
EURLEX
en
"| | | | |\n| --- | --- | --- | --- |\n| 20.9.2017 | EN | Official Journal of the European Union(...TRUNCATED)
{"source":"https://eur-lex.europa.eu/legal-content/AUTO/?uri=CELEX:32017L1564&qid=1761046312900&rid=(...TRUNCATED)
md
EURLEX
en
"| | | | |\n| --- | --- | --- | --- |\n| 27.12.2006 | EN | Official Journal of the European Unio(...TRUNCATED)
{"source":"https://eur-lex.europa.eu/legal-content/AUTO/?uri=CELEX:32006L0116&qid=1761046312900&rid=(...TRUNCATED)
md
EURLEX
en
"| | | |\n| --- | --- | --- |\n| European flag | Official Journal of the European Union | EN Se(...TRUNCATED)
{"source":"https://eur-lex.europa.eu/legal-content/AUTO/?uri=CELEX:32023R2411&qid=1761046312900&rid=(...TRUNCATED)
End of preview.

📘 ALIA_INTELLECTUAL_PROPERTY Dataset

The ALIA_INTELLECTUAL_PROPERTY dataset is a multilingual resource designed for text generation tasks within the intellectual property (IP) domain, including topics such as copyrights, patents, trademarks, and related legal and institutional information.

The dataset consists of textual documents formatted in Markdown (.md), each provided as structured JSONL entries. Each entry includes information about the text's source, language, format, text, and metadata.

🧾 Column Descriptions

Field Type Description
format string Indicates the text format. All entries use "md" (Markdown).
source string Source of the document (institution, website, or project).
language string Language of the content: "es" (Spanish).
text string The main textual content in Markdown format.
metadata object Supplementary information.

🌍 Dataset Composition

  • Languages: Spanish (es)
  • Domain: Intellectual Property (copyright, patents, trademarks, and related legal frameworks)
  • Format: JSON Lines (.jsonl)

Each item represents a standalone intellectual-property-related text.

🔎 Sources

  • eurlex-es-md.jsonl: Filtered content from EUR-Lex (Spanish) using the keyword "intellectual property".

⚠️ Notes

  • The dataset is automatically curated from intellectual property–related sources.
  • Metadata coverage may vary by entry.
  • Content may include Markdown formatting for structure (e.g., headers, lists, emphasis).

💰 Funding

This work is funded by the Ministerio para la Transformación Digital y de la Función Pública, co-financed by the EU – NextGenerationEU, within the framework of the project Desarrollo de Modelos ALIA.

📚 Reference

Please cite this dataset using the following BibTeX format:


@misc{alia2025intellectualproperty,
author = {Espinosa Zaragoza, Sergio and Maestre, Mar{'\i}a Mir{'o} and Mu{~n}oz Guillena, Rafael and Consuegra-Ayala, Juan Pablo},
title = {ALIA_INTELLECTUAL_PROPERTY Dataset},
year = {2025},
institution = {Language and Information Systems Group (GPLSI) and Centro de Inteligencia Digital (CENID), University of Alicante (UA)},
howpublished = {\url{[https://huggingface.co/datasets/gplsi/alia_intellectual_property}}](https://huggingface.co/datasets/gplsi/alia_intellectual_property}})
}

⚠️ Disclaimer

Be aware that the data may contain biases or other unintended distortions. When third parties deploy systems or provide services based on this data, or use the data themselves, they bear the responsibility for mitigating any associated risks and ensuring compliance with applicable regulations, including those governing the use of Artificial Intelligence.

The University of Alicante, as the owner and creator of the dataset, shall not be held liable for any outcomes resulting from third-party use.

📜 License

This work is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) licence.

Downloads last month
31

Models trained or fine-tuned on gplsi/alia_intellectual_property