Add row ordering by `time` and `monitoring_location_id`, if applicable, plus... #203

ehinman · 2025-12-10T22:12:13Z

...move probably-useless-to-user database ids for specific endpoints to the end of the dataframe, so they're less likely to impede review of the data returned.

Also added a couple tests for these changes. Note that the time component is sorted in ascending order, so earlier results show up first.

Closes #202 and is a slightly softer solution to #201, since users can request columns in the properties argument, anyway.

…d of df

thodson-usgs

looks good,

thodson-usgs · 2025-12-11T22:16:09Z

dataretrieval/waterdata/utils.py

            # of the output_id (e.g. "monitoring_locations_id"), then rename
            # "id" to plural. This is pretty niche.
            else:
                plural = output_id.replace("_id", "s_id")


this line confuses me, but the code appears to work as it should

We can revisit, because this might not be necessary. I'll make an issue. Basically, every service returns a straight up "id" column with the data, which is actually different across services. So the package adds the service name to the beginning of the "id" column, e.g. "monitoring_location_id", "daily_id", etc. This part of the function accounts for whether someone enters just "id" into their properties argument, or enters "monitoring_locationS_id" (maybe they notice that pattern that it's service + id, and the sites service is called "monitoring-locationS"). If they enter "id", then the resulting dataframe will have the "monitoring_location_id" column name. But if they enter "monitoring_locations_id" (straight up service name, "monitoring locations", plus "id"), then it will return the column name "monitoring_locations_id". I kinda doubt this will be leveraged at all, and adds confusion.

add row ordering by time and site and move frivolous id columns to en…

7c66421

…d of df

ehinman requested a review from thodson-usgs December 10, 2025 22:14

thodson-usgs approved these changes Dec 11, 2025

View reviewed changes

ehinman merged commit 98f4ddd into DOI-USGS:main Dec 11, 2025
7 checks passed

ehinman mentioned this pull request Dec 11, 2025

waterdata module - option to remove "superfluous" ID columns and move stable ID columns to end of the dataframe #201

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add row ordering by `time` and `monitoring_location_id`, if applicable, plus... #203

Add row ordering by `time` and `monitoring_location_id`, if applicable, plus... #203

Uh oh!

ehinman commented Dec 10, 2025 •

edited

Loading

Uh oh!

thodson-usgs left a comment

Uh oh!

thodson-usgs Dec 11, 2025

Uh oh!

ehinman Dec 11, 2025

Uh oh!

ehinman Dec 11, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add row ordering by time and monitoring_location_id, if applicable, plus... #203

Add row ordering by time and monitoring_location_id, if applicable, plus... #203

Uh oh!

Conversation

ehinman commented Dec 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

thodson-usgs left a comment

Choose a reason for hiding this comment

Uh oh!

thodson-usgs Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

ehinman Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

ehinman Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add row ordering by `time` and `monitoring_location_id`, if applicable, plus... #203

Add row ordering by `time` and `monitoring_location_id`, if applicable, plus... #203

ehinman commented Dec 10, 2025 •

edited

Loading