Dataset License Agreement

Based on Open Data Commons Attribution-NonCommercial (ODC-BY-NC) with Clarifications

Quick Reference Guide

Permitted

  • University research → open access publication
  • Non-profit public security study → free report
  • Government agency analysis → public dataset
  • Classroom teaching and free online courses
  • Academic benchmarks (freely published)

Prohibited

  • Startup using data for product development
  • Publishing in Nature (paywall) without permission
  • Sponsored research with IP restrictions
  • Training commercial AI models
  • Commercial consulting services

Contact First

  • Industry partnership with open publication
  • Non-profit with revenue-generating activities
  • Hybrid academic-industry projects

Questions or need a commercial license?

hello@sting9.org

License Terms

1. Definitions

"Dataset"

means the collection of data made available under this license.

"Commercial Use"

means any use primarily intended for or directed toward commercial advantage or monetary compensation, including but not limited to:

  • Use by for-profit entities in their business operations
  • Use in developing commercial products or services
  • Use in industry-sponsored research where the sponsor retains commercial rights to results
  • Publication of results or derivatives behind paywalls or subscription services
  • Use to train AI/ML models for commercial products
  • Consulting services using this Dataset

"Non-Commercial Use"

means use that meets ALL of the following criteria:

  • Primary purpose is academic research, education, or public benefit
  • Results are made freely and publicly available without access restrictions
  • No monetary compensation is derived from the use
  • If sponsored, sponsor cannot be a for-profit entity unless they waive all commercial rights

"Permitted Entities"

include:

  • Accredited educational institutions (for truly non-commercial research)
  • Registered non-profit organizations
  • Government agencies and public institutions
  • Independent researchers publishing in open access venues

2. Grant of Rights

You are free to:

  • Share: Copy and redistribute the Dataset
  • Adapt: Modify, transform, and build upon the Dataset
  • Use: Use the Dataset for Non-Commercial purposes

3. Conditions

Attribution

You must give appropriate credit, provide a link to this license, and indicate if changes were made. Include:

  • Dataset name and version
  • Original source/creator
  • Link to original Dataset
  • Statement: "Used under ODC-BY-NC with clarifications"

Non-Commercial

You may not use the Dataset for Commercial Use as defined above.

Open Results

Any research, publications, or derivatives must be:

  • Published in open access venues (no paywalls)
  • Made freely available to the public
  • Released under compatible open licenses

4. Specific Restrictions

Prohibited Uses:

  • Publishing research results in journals requiring payment or subscription for access
  • Industry-sponsored research where sponsor retains exclusive rights, patents, or commercial control over findings
  • Training proprietary AI/ML models for commercial deployment
  • Use by commercial entities, even for "research purposes," unless approved in writing
  • Sublicensing with commercial terms

Paywalled Publications:

You may NOT publish results derived from this Dataset in venues that require payment, subscription, or institutional access.

Permitted:

  • Preprint servers (arXiv, bioRxiv, etc.)
  • Open access journals with author fees (APCs) if final publication is freely accessible
  • Conference proceedings only if freely available online

Industry-Sponsored Research:

Permitted ONLY if:

  • All results remain in public domain
  • Sponsor waives all commercial rights in writing
  • Sponsor cannot restrict publication or use of findings
  • Results published in open access venues

5. Commercial Licensing

For Commercial Use, contact hello@sting9.org to obtain a separate commercial license.

Commercial licenses are available for:

  • For-profit research and development
  • Commercial product development
  • Industry-sponsored research with commercial rights
  • Proprietary AI/ML model training

6. Disclaimer and Limitation of Liability

THE DATASET IS PROVIDED "AS IS" WITHOUT WARRANTY OF ANY KIND. IN NO EVENT SHALL THE LICENSOR BE LIABLE FOR ANY CLAIM, DAMAGES, OR LIABILITY ARISING FROM USE OF THE DATASET.

7. Recommended Citation

Sting9 Research Initiative. (2025).
Sting9 Anti-Phishing Dataset.
Licensed under ODC-BY-NC with clarifications.
Available from: https://sting9.org/

Questions About Licensing?

We're happy to clarify any questions or discuss commercial licensing options.

Contact Us About Licensing