A framework for prompt tuning using Intent-based Prompt Calibration
-
Updated
Jun 23, 2024 - Python
A framework for prompt tuning using Intent-based Prompt Calibration
⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.
Perception toolkit for sim2real training and validation in Unity
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤
Configurable Generation of Synthetic Schemas and Knowledge Graphs at Your Fingertips
A curated list of awesome projects which use Machine Learning to generate synthetic content.
A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.
NVIDIA Deep learning Dataset Synthesizer (NDDS)
SynthDet - An end-to-end object detection pipeline using synthetic data
Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes
Unity's privacy-preserving human-centric synthetic data generator
Random dataframe and database table generator
[IMC 2020 (Best Paper Finalist)] Using GANs for Sharing Networked Time Series Data: Challenges, Initial Promise, and Open Questions
DataGene - Identify How Similar TS Datasets Are to One Another (by @firmai)
A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.
awesome synthetic (text) datasets
[CVPR 2021] DeFMO: Deblurring and Shape Recovery of Fast Moving Objects
Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing"
[ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement
NVIDIA Dataset Utilities (NVDU)
Add a description, image, and links to the synthetic-dataset-generation topic page so that developers can more easily learn about it.
To associate your repository with the synthetic-dataset-generation topic, visit your repo's landing page and select "manage topics."