Skip to content

Releases: databrickslabs/dbldatagen

release 0.4.0

07 Jun 21:44
aae8bde
Compare
Choose a tag to compare

This release adds the following new features:

  • various bug fixes
  • support for Constraints
  • support for standard datasets

The new standard dataset feature allows creation of synthetic data sets in just a couple of lines of code for benchmarking / optimization and other purposes

release/v.0.3.6.1: Feature hotfixes (#274)

22 May 19:34
2482dca
Compare
Choose a tag to compare

Hot fixes post v0.3.6

  • Updates to documentation
  • updates to enable dbldatagen work better with Databricks Connect
  • bumped version

Release v0.3.6

24 Feb 01:59
Compare
Choose a tag to compare

This release includes fixes for use of dbldatagen on the Databricks shared clusters

release/v0.3.5

13 Jul 17:21
1c8b340
Compare
Choose a tag to compare

This release adds the following enhancements:

  • doc updates and additional content
  • additional support for creating struct and JSON valued columns
  • modifications to data spec code generation from existing datasets
  • enhancements to reduce scenarios where manual use of base columns are needed

Hotfix v0.3.4post2

18 Apr 22:59
Compare
Choose a tag to compare

Fix for use of values in complex typed columns (array, struct, or map)

Hotfix v0.3.4post1

13 Apr 17:02
Compare
Choose a tag to compare

Hotfix to address logger modifying root logger behavior

release/v0.3.4

09 Apr 00:43
Compare
Choose a tag to compare

The new release - labelled v0.3.4 adds improvements to build ordering, some bug fixes along with a new feature for automatically generating code to generate your synthetic data set from an existing schema or dataset (once it has been loaded as a Spark dataframe).

The new code generation feature has been marked as experimental -- meaning that it is subject to change in coming releases.

The generated code is intended to be stub code that users will modify or refine to generate synthetic data that more closely matches their data set.

Feel free to file issues, or add discussion items with suggestions or feedback on improving this feature as it evolves.

release/v0.3.3post2

23 Mar 02:53
Compare
Choose a tag to compare

Corrected release versioning as PyPi will still serve latest release

  • includes corrections for logger initialization
  • includes hotfix for template expansion
  • includes use of pandas optimizations when processing column definitions with multiple output columns

Release v0.3.2 postfix 2

22 Mar 20:30
Compare
Choose a tag to compare

Hotfix for template expansion for v0.3.2

release/v0.3.2

11 Mar 05:31
93caee4
Compare
Choose a tag to compare

Release V0.3.2

  • enhancements to build ordering to reduce need to explicitly call out base columns when referring to columns in expr SQL expression
  • fixes for use with Databricks older LTS runtimes