Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Performance improvement #1

Open
aaron-gu opened this issue Dec 19, 2019 · 2 comments
Open

Performance improvement #1

aaron-gu opened this issue Dec 19, 2019 · 2 comments
Assignees
Labels
enhancement New feature or request

Comments

@aaron-gu
Copy link
Contributor

Python Pandas is slow for the large number of single-row operations in bedshift. It may be faster to read bedfiles into a native object like a list or dictionary and conduct operations on it.

@aaron-gu aaron-gu added the enhancement New feature or request label Dec 19, 2019
@aaron-gu aaron-gu self-assigned this Dec 19, 2019
@aaron-gu
Copy link
Contributor Author

This has been mostly addressed with commit d6674fc. Cut and merge are the two slowest operations now, but can still complete reasonably quickly. I'll keep this issue open to see if we can move off of pandas in the future.

@aaron-gu
Copy link
Contributor Author

Also see #11

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant