Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add more pre-commit checks (mccabe, bandit, mypy, etc) #557

Merged
merged 24 commits into from
Nov 21, 2023

Conversation

djhoese
Copy link
Member

@djhoese djhoese commented Nov 17, 2023

Let's bring pyresample into the modern age of 5 years ago with some real complexity and type checking.

  • Closes #xxxx
  • Tests added
  • Tests passed
  • Passes git diff origin/main **/*py | flake8 --diff
  • Fully documented

@djhoese djhoese self-assigned this Nov 17, 2023
@djhoese djhoese changed the title Add more pre-commit checks (mccabe, bandit, mypy, etc) and initial run Add more pre-commit checks (mccabe, bandit, mypy, etc) Nov 17, 2023
Copy link

codecov bot commented Nov 17, 2023

Codecov Report

Attention: 80 lines in your changes are missing coverage. Please review.

Comparison is base (d8f45cd) 94.13% compared to head (a7e1f25) 94.08%.

Files Patch % Lines
pyresample/utils/cf.py 68.18% 14 Missing ⚠️
pyresample/kd_tree.py 91.72% 12 Missing ⚠️
pyresample/plot.py 21.42% 11 Missing ⚠️
pyresample/area_config.py 86.36% 9 Missing ⚠️
pyresample/ewa/ewa.py 66.66% 9 Missing ⚠️
pyresample/gradient/__init__.py 0.00% 6 Missing ⚠️
pyresample/slicer.py 0.00% 5 Missing ⚠️
pyresample/bilinear/__init__.py 0.00% 3 Missing ⚠️
pyresample/bilinear/xarr.py 0.00% 2 Missing ⚠️
pyresample/future/resamplers/nearest.py 60.00% 2 Missing ⚠️
... and 5 more
Additional details and impacted files
@@            Coverage Diff             @@
##             main     #557      +/-   ##
==========================================
- Coverage   94.13%   94.08%   -0.06%     
==========================================
  Files          84       84              
  Lines       13188    13228      +40     
==========================================
+ Hits        12415    12446      +31     
- Misses        773      782       +9     
Flag Coverage Δ
unittests 94.08% <79.48%> (-0.06%) ⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@coveralls
Copy link

coveralls commented Nov 17, 2023

Coverage Status

coverage: 93.671% (-0.05%) from 93.72%
when pulling a7e1f25 on djhoese:precommit-mypy
into d8f45cd on pytroll:main.

@djhoese djhoese marked this pull request as ready for review November 19, 2023 21:07
@djhoese djhoese requested review from mraspaud and pnuu November 20, 2023 02:12
@djhoese
Copy link
Member Author

djhoese commented Nov 20, 2023

@mraspaud @pnuu I think this is ready. Let me know what you think. I took all of the pre-commit configuration from pre-ruff satpy. The biggest issue/difficulty was mccabe complaining about code complexity which isn't surprising given the age of this package. The biggest "offenders" were the new-ish area_config.py stuff which I blame partially on me and my supervision of Will Roberts on adding it. I think this could probably use a rewrite.

The biggest problem was the get_sample_from_neighbour_info method. This method is...a problem. I got to the point that I couldn't refactor it the way it needed without risking breaking backwards compatibility. Even though we have plenty of tests, I don't trust them to the extent that this method needs to be refactored. As a shortcut/cheat, I mostly just put chunks of code into a function and called it. This meant having to create functions that have a ton of arguments. I think in the long run this functionality will be rewritten in a resampler class in the future and to support DataArrays, dask arrays, and numpy arrays so I'm not tempted to try harder on this. That said, I think here are some avenues that someone could go down if they wanted the code to be better:

  1. A helper class: Many of the arguments passed around are kind of like a "state" that is used by multiple areas. A single class with those as instance attributes might make the code look cleaner.
  2. Remove all specialization of "multi-channel" arrays and instead always shape input into a 1 or more channel array. Then refactor things into shared functions that don't have to do with the "certainty" or "weighted" functionality so that this function could be split into 3 separate functions (maybe) with each of these 3 cases: nearest, weighted, weighted + certainty.

Copy link
Member

@mraspaud mraspaud left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks a lot from breaking down all the complexity! I could not check that the logic was preserved in most places, but trust this is mostly refactoring, right?

And how about using ruff in place of flake8, and maybe even bandit?

@@ -111,7 +112,7 @@

# List of directories, relative to source directory, that shouldn't be searched
# for source files.
exclude_trees = []
exclude_trees: list[str] = []
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Does mypy really need to run on this file?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No, but I also don't mind it. I noticed in a new project that sphinx now removes all unnecessary/optional arguments from conf.py when you run the quickstart command. We could consider comparing that to what is in this conf.py and clearing unused things out.

@mraspaud
Copy link
Member

Here's the ruff config that would mimic what we have with flake8 today:

[tool.ruff]
# See https://docs.astral.sh/ruff/rules/
select = ["E", "W", "B", "D", "T10", "C90",]
line-length = 120

[tool.ruff.pydocstyle]
convention = "google"

[tool.ruff.mccabe]
# Unlike Flake8, default to a complexity level of 10.
max-complexity = 10

for bandit, that would be "S", but not I'm unsure the functionality is actually the same.

@djhoese
Copy link
Member Author

djhoese commented Nov 20, 2023

I could not check that the logic was preserved in most places, but trust this is mostly refactoring, right?

Right. There are only two places that I can think of where things were more heavily changed:

  1. This is still a refactor, but the legacy area parsing was changed to take advantage of the generator-ness of the file handle. The code is much more readable and tests pass.
  2. I changed a warning to an exception in the area_config.py parsing of YAML formats where someone types a degrees unit, but likely misspells it. It was detecting "deg" being in units but the overall word was not "degrees" or "deg". I could/should maybe just take that check out completely.

@djhoese
Copy link
Member Author

djhoese commented Nov 20, 2023

And how about using ruff in place of flake8, and maybe even bandit?

I had decided not to use it because I did not want to go down the path of all of the rules that I know you worked on for Satpy. The example config you have looks easy enough so I'll try updating to that today.

However, ruff still doesn't handle blank lines between functions/methods and it is pretty annoying to me (it has been annoying to me to review Satpy PRs where there are 3 or more lines between functions). What do you think about waiting until that is handled to switch to ruff?

for bandit, that would be "S", but not I'm unsure the functionality is actually the same.

How so?

@mraspaud
Copy link
Member

However, ruff still doesn't handle blank lines between functions/methods and it is pretty annoying to me (it has been annoying to me to review Satpy PRs where there are 3 or more lines between functions). What do you think about waiting until that is handled to switch to ruff?

oh, ok, I had missed that. That's pretty annoying indeed. We could also have ruff run in parallel with (bare) flake8 then maybe. But otherwise we can wait, yes.

for bandit, that would be "S", but not I'm unsure the functionality is actually the same.

How so?

The "S" rule for ruff is an implementation of flake8-bandit, not bandit itself. And flake8-bandit is calling bandit, so I suppose we'd better call bandit directly.

Copy link
Member

@mraspaud mraspaud left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good, just a couple of comments inline

.bumpversion.cfg Show resolved Hide resolved
.pre-commit-config.yaml Outdated Show resolved Hide resolved
@djhoese
Copy link
Member Author

djhoese commented Nov 21, 2023

Ok so CodeFactor is complaining about 11 things which were all existing TODOs in these modules. Codebeat on the other hand says 0 introduced issues and 125 resolved. Not bad.

@djhoese
Copy link
Member Author

djhoese commented Nov 21, 2023

Merging this now and then I'll hope to resolve conflicts in @ghiggi's PR tonight.

@djhoese djhoese merged commit 25d0c72 into pytroll:main Nov 21, 2023
18 of 21 checks passed
@djhoese djhoese deleted the precommit-mypy branch November 21, 2023 21:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants