AI RESEARCH
UK GDPR Small Business Q&A — 5,000 synthetic pairs with article-level citations [D]
r/MachineLearning
•
Dataset for fine-tuning compliance assistants. Each pair includes: - A practical SME-facing question ("Can I use pre-ticked consent boxes?") - An answer with specific UK GDPR article references, ICO guidance by name, and actionable steps - Source metadata: which GDPR concepts were used, which generation strategy, timestamp Generation method: questions via local Qwen 14B from a curated term bank, answers via DeepSeek API for factual reliability. JSON + Parquet, MIT license for the 1K sample.