Python ThreadPoolExecutor: 7-Day Crash Course

jasonb@fediverser.communick.dev · 11 months ago

Python ThreadPoolExecutor: 7-Day Crash Course

jasonb@fediverser.communick.dev · 1 year ago

Very cool! Thanks.

jasonb@fediverser.communick.dev · 1 year ago

Agreed. My git fu was weak in 2010.

jasonb@fediverser.communick.dev · 1 year ago

For sure!

jasonb@fediverser.communick.dev · 1 year ago

Thanks. I left out free()'s to get the thing working fast, but you’re right, they should be in there.

Nod, I’m leaving out early stopping so all versions are comparable.

jasonb@fediverser.communick.dev · 1 year ago

Noted, thanks. I may swing back around and add some free() calls.

jasonb@fediverser.communick.dev · 1 year ago

UPDATE: I’ve added an optimized numpy version that is as fast (actually slightly) faster than the naive c version. Yay!

jasonb@fediverser.communick.dev · 1 year ago

Good point.

I used to struggle in the same way with kaggle competitions. Whole parts of my pipeline would change, but wanted/needed to preserve each pipeline independently for ad hoc experimentation. I used parallel dirs in the git repo.

jasonb@fediverser.communick.dev · 1 year ago

The ideas is to see the progression as the code is optimized away for readability toward performance.

You’d prefer the files to be named meaningfully? Or you’d prefer to use “git diff” to see changes and have a single copy?

The risk is a new version has a good idea, but is slower. It should not be the canonical version, just a stepping stone.

jasonb@fediverser.communick.dev · 1 year ago

Can a Python genetic algorithm run as fast as C?