Skip to content

Releases: jacksonllee/pylangacq

v0.19.1

29 Mar 11:23
Compare
Choose a tag to compare

Fixed

  • Handled the duration mark (e.g., [# 0.4]) in utterance cleaning.

v0.19.0

13 Dec 05:28
Compare
Choose a tag to compare

Added

  • Added support for Python 3.12.
  • Handled pre-clitics and post-clitics from %mor tiers
    and honored their distinction in the parsed utterance.

v0.18.0

12 Mar 01:07
Compare
Choose a tag to compare

Added

  • Added support for Python 3.11.

Changed

  • Updated the test data from Brown's Eve from the upstream CHILDES.

Removed

  • Dropped support for Python 3.7.

v0.17.0

10 Jun 02:45
Compare
Choose a tag to compare

[0.17.0] - 2022-06-09

Added

  • Added the exclude_switch option for MLU (mlu(), mlum(), and mluw()),
    so that words with @s for switching language may be excluded.

Fixed

  • Fixed MLU computation (mlu(), mlum(), and mluw()):
    • If xxx, yyy, or www appears in an utterance, the whole utterance is ignored.
    • If there are no MLU-relevant words/morphemes in an utterances, the whole utterance is ignored.

v0.16.2

17 Mar 05:53
Compare
Choose a tag to compare

[0.16.2] - 2022-03-17

Fixed

  • Moved download_and_extract_brown test function to under the pylangacq package
    namespace, as tests from BaseTestCHATReader require downloaded CHAT data files.

v0.16.1

17 Mar 05:15
Compare
Choose a tag to compare

[0.16.1] - 2022-03-17

Changed

  • Restructured the repository to use top-level src/ and tests/ directories.

Removed

  • Removed setup.py.

Fixed

  • Moved BaseTestCHATReader back under the pylangacq package namespace
    so that downstream packages can import BaseTestCHATReader for testing.

v0.16.0

28 Dec 04:56
Compare
Choose a tag to compare

[0.16.0] - 2021-12-27

Added

  • Reader objects can now be concatenated by the addition operator +.
  • Implemented the head, tail, and info methods at Reader.
  • Added support for Python 3.10.
  • Turned on Windows testing on CircleCI.
  • Added pyproject.toml. Related to prioritizing setup.cfg for specifying
    build metadata and options.

Changed

  • The to_strs and to_chat methods of a Reader object return
    tabulated outputs by default.
  • Prioritized to_chat for the single file output use case.
  • Unzipping CHAT data now uses less memory.
  • Switched to setup.cfg to fully specify build metadata and options,
    while keeping a minimal setup.py for backward compatibility.
    Related to the new pyproject.toml.
  • Switched the Sphinx docs theme from sphinx-rtd-theme to furo.

Removed

  • Dropped support for Python 3.6.

Security

  • Turned on safety and bandit checks at CircleCI builds.

v0.15.0

06 Jun 15:10
Compare
Choose a tag to compare

[0.15.0] - 2021-06-06

Added

  • Reader.from_zip (also read_chat) now keeps the downloaded ZIP archive
    in a non-temporary directory for possible re-use.
    • Added the kwarg use_cached in Reader.from_zip, so that we use the cached data
      by default for the same input URL, and that we can force re-downloading by
      setting use_cached to False.
    • Added the kwarg session in Reader.from_zip, in case using a customized
      requests.Session instance is desired. session also makes it possible to
      write tests for the new kwarg use_cached.
    • Added the helper functions cached_data_info and remove_cached_data.
  • Reader has the new to_strs method that yields CHAT data strings.
  • Reader has the new to_chat method that exports data to local files.

Changed

  • CHAT parsing for the header information is now more robust for varying whitespace
    characters between the head and its associated value.

Removed

  • Dropped kwarg allow_remote in Reader.from_zip. This kwarg wouldn't make any sense
    anymore, or at least would be confusing with the introduction of use_cached.

v0.14.1

17 May 00:56
Compare
Choose a tag to compare

[0.14.1] - 2021-05-16

Fixed

  • The header/metadata has a more reasonable representation for emptiness
    when input data is empty.

v0.14.0

12 May 20:26
Compare
Choose a tag to compare

[0.14.0] - 2021-05-12

Added

  • Added the parallel optional argument to the Reader methods
    {from_zip, from_dir, from_files, from_strs}
    so that parallelization can be turned off if desired.
  • Added the filter method to Reader for filtering data by file paths.