Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature branch: Rename core + output assets to match new naming protocols #2818

Merged
merged 111 commits into from
Dec 16, 2023
Merged
Show file tree
Hide file tree
Changes from 11 commits
Commits
Show all changes
111 commits
Select commit Hold shift + click to select a range
e0ed4f2
Rename static tables
e-belfer Aug 30, 2023
5f815b3
Rename Census DP1 assets
e-belfer Sep 1, 2023
8da9db6
Test doc fix
e-belfer Sep 1, 2023
bb75aa1
Update core table names for EIA 860, 923, harvested tables, FERC1, code
e-belfer Sep 7, 2023
2bb53f5
Fix integration tests
e-belfer Sep 7, 2023
0cd5e9d
Fix alembic
e-belfer Sep 7, 2023
8790131
Rename 714, 861, epacems
e-belfer Sep 12, 2023
b851fe7
update tests and rest of assets
e-belfer Sep 13, 2023
9d7996d
Resolve merge conflict
e-belfer Sep 13, 2023
2232577
Fix validation tests
e-belfer Sep 13, 2023
7e7303a
Rename ferc output assets
bendnorman Sep 13, 2023
2fcf9f1
Merge branch 'rename-core-assets' into rename-ferc-output-assets
bendnorman Sep 13, 2023
2ee75b2
Rename denorm_cash_flow_ferc1 and remove leading underscore from cros…
bendnorman Sep 13, 2023
ec426d6
Merge branch 'dev' into rename-core-assets
e-belfer Sep 14, 2023
ada8b01
Rename a missing ferc output table and add migration
bendnorman Sep 14, 2023
3417344
Merge branch 'rename-core-assets' into rename-ferc-output-assets
e-belfer Sep 14, 2023
62f0f50
Rename EIA denorm assets
bendnorman Sep 15, 2023
00ce1e9
Recreate ferc rename migration
bendnorman Sep 15, 2023
34dba80
Add docs cross ref fix for intermediate assets
bendnorman Sep 15, 2023
ac79bd5
Resolve small denorm EIA rename issues
bendnorman Sep 17, 2023
ce16bbd
Clean up notebooks
e-belfer Sep 18, 2023
8a4e6ea
Apply naming convention to allocate generation fuel assets
bendnorman Sep 18, 2023
0a912e4
Merge pull request #2856 from catalyst-cooperative/rename-ferc-output…
bendnorman Sep 18, 2023
6b14304
Fix a missing gen fuel asset name in PudlTabl
bendnorman Sep 18, 2023
325fb52
Merge branch 'rename-core-assets' into rename-eia-output-assets
bendnorman Sep 18, 2023
a2042de
Update migrations post ferc1 output rename merge
bendnorman Sep 19, 2023
8b468db
Merge pull request #2858 from catalyst-cooperative/rename-eia-output-…
bendnorman Sep 19, 2023
5f182f9
Merge branch 'rename-core-assets' into rename-allocate-gen-fuel-assets
e-belfer Sep 20, 2023
77a16f5
Update contributor facing documentation with new asset naming convent…
bendnorman Sep 20, 2023
4d5b57d
Add new naming convention to user facing documentation
bendnorman Sep 20, 2023
efb2bbd
Correct allocate-get-fuel down revision
bendnorman Sep 20, 2023
239eb4d
Apply new naming convention to ferc714 respondents, hourly demand and…
bendnorman Sep 21, 2023
09a876d
Merge pull request #2865 from catalyst-cooperative/rename-allocate-ge…
e-belfer Sep 21, 2023
5ebedcd
Fix refs to renamed tables in release notes
bendnorman Sep 21, 2023
6ffe6a5
Rename ferc714 and eia861 output tables in integration tests
bendnorman Sep 21, 2023
d257e52
Merge branch 'rename-core-assets' into rename-annualized-respondents-…
e-belfer Sep 21, 2023
97149cb
Add missing balance authority fk migration
bendnorman Sep 25, 2023
6a5411a
Rename out_ferc714__fipsified_respondents to out_ferc714__respondents…
bendnorman Sep 26, 2023
4d256ec
Respond to first round of Austen's comments
bendnorman Sep 26, 2023
9d30977
Merge pull request #2882 from catalyst-cooperative/rename-annualized-…
bendnorman Sep 26, 2023
ef4b5ad
Merge branch 'rename-core-assets' into create-naming-convention-docs
bendnorman Sep 26, 2023
1a9028d
Update rename-core-assets and clarify raw asset sentence
bendnorman Sep 26, 2023
32dc9ac
Restrict astroid version to avoid random autoapi error
bendnorman Sep 26, 2023
ee23cba
Merge branch 'dev' into rename-core-assets
bendnorman Sep 27, 2023
d8884c2
Merge branch 'dev' into rename-core-assets
bendnorman Sep 28, 2023
765c420
Reset migrations and fix old table refs in docs
bendnorman Sep 28, 2023
7a7a441
Fix names of inputs to exploded tables and xbrl calculation fixes
bendnorman Sep 28, 2023
1aa5116
Rename mcoe and ppl assets
bendnorman Sep 29, 2023
01d3c73
Merge branch 'rename-core-assets' into rename-mcoe-assets
bendnorman Sep 29, 2023
f231452
Fix small ppl migration issue
bendnorman Sep 29, 2023
7b7dba1
Format and sort intermediate resource name cross refs in data dictionary
bendnorman Oct 2, 2023
50cea89
Add upstream mcoe assets back to metadata
bendnorman Oct 2, 2023
791a70b
Update stragler PudlTabl method name
bendnorman Oct 3, 2023
9f578b3
Add frequency to ppl asset name and some clean up
bendnorman Oct 5, 2023
1b5100e
Merge pull request #2904 from catalyst-cooperative/rename-mcoe-assets
bendnorman Oct 6, 2023
8d2ab9a
Merge branch 'dev' into rename-core-assets
cmgosnell Oct 31, 2023
c63dd8f
rename six of the non-contreversial FERC1 tables (core + out)
cmgosnell Oct 31, 2023
e8db0ad
initial rename of the FERC1 core and out tables
cmgosnell Nov 1, 2023
bb088af
add db migration
cmgosnell Nov 1, 2023
8d3b058
rename the ferc1 transformer classes in line with new table names
cmgosnell Nov 1, 2023
797d40e
Merge branch 'rename-core-assets' into create-naming-convention-docs
bendnorman Nov 1, 2023
33fab91
Incorporate some docs changes from #2912
bendnorman Nov 1, 2023
c5fb34f
FINAL FINAL rename of ferc assets
cmgosnell Nov 3, 2023
fc7de0e
ooooops remove the eia860m extraction edit bc that was not supposed t…
cmgosnell Nov 3, 2023
c2af359
Merge branch 'dev' into rename-core-assets
cmgosnell Nov 3, 2023
50e3eef
Merge branch 'rename-core-assets' into create-naming-convention-docs
bendnorman Nov 6, 2023
0c3b9ae
Merge pull request #2995 from catalyst-cooperative/rename-ferc1-assets
bendnorman Nov 6, 2023
10111e4
Remove README.rst from index.rst and move intro content to index
bendnorman Nov 7, 2023
85c6fe3
Add deprecation warnings to PudlTabl and add minor naming docs updates
bendnorman Nov 8, 2023
d61005d
Rename heat_rate_mmbtu_mwh -> heat_rate_mmbtu_mwh_by_unit
bendnorman Oct 4, 2023
53e2f2d
Rename heat rate mmbtu mwh to follow existing naming convention
bendnorman Oct 4, 2023
479ec7f
Remove PudlTabl removal data and make assn table name sources alphabe…
bendnorman Nov 8, 2023
c329804
Explain why CEMS is stored as parquet
bendnorman Nov 8, 2023
d8c01da
Rename heat_rate_mmbtu_mwh_eia/ferc1 columns to unit_heat_rate_mmbtu_…
bendnorman Nov 8, 2023
afaa449
Remove unused ppe_cols_to_grab variable
bendnorman Nov 8, 2023
53d5618
Merge pull request #3028 from catalyst-cooperative/create-renaming-re…
bendnorman Nov 9, 2023
f60592f
Make association asset names more consistent
bendnorman Nov 9, 2023
cb9b188
Merge pull request #2874 from catalyst-cooperative/create-naming-conv…
bendnorman Nov 10, 2023
19a9e7a
Merge branch 'rename-core-assets' into rename-assn-assets
bendnorman Nov 10, 2023
1d2d71c
Add association assset naming convention to docs
bendnorman Nov 10, 2023
0f90efa
Resolve migration issues with unit heat rate column
bendnorman Nov 15, 2023
578a033
Merge pull request #3035 from catalyst-cooperative/rename-assn-assets
bendnorman Nov 15, 2023
46a83b6
Merge branch 'dev' into rename-core-assets
bendnorman Nov 15, 2023
4a2be6a
Update conda-lock.yml and rendered conda environment files.
bendnorman Nov 15, 2023
f82b56e
Merge branch 'rename-core-assets' into rename-heat-rate-mmbtu-mwh-column
bendnorman Nov 15, 2023
7ef9c70
Recreate heat rate migration revision
bendnorman Nov 15, 2023
1396ad8
Merge pull request #3029 from catalyst-cooperative/rename-heat-rate-m…
bendnorman Nov 16, 2023
1e32e57
Merge branch 'dev' into rename-core-assets
bendnorman Nov 16, 2023
0fb7b9f
Use pudl_sqlite_io_manager for fuel_cost_by_generator assets
bendnorman Nov 17, 2023
3528935
Merge branch 'dev' into rename-core-assets
bendnorman Nov 17, 2023
271ffc3
Update conda-lock.yml and rendered conda environment files.
bendnorman Nov 17, 2023
9db6ec2
Checkout lock files from dev
bendnorman Nov 30, 2023
d27c0ca
Merge branch 'dev' into rename-core-assets
bendnorman Dec 1, 2023
77712e6
Update conda-lock.yml and rendered conda environment files.
bendnorman Dec 1, 2023
f07f0a5
Merge branch 'dev' into rename-core-assets
bendnorman Dec 1, 2023
3237907
Merge branch 'rename-core-assets' of github.com:catalyst-cooperative/…
bendnorman Dec 1, 2023
1928c14
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Dec 1, 2023
5e85257
Remove intro.rst and update ferc s3 urls again
bendnorman Dec 1, 2023
e86daad
Merge branch 'rename-core-assets' of github.com:catalyst-cooperative/…
bendnorman Dec 1, 2023
68f2ec5
Merge branch 'dev' into rename-core-assets
bendnorman Dec 4, 2023
24968f4
Update conda-lock.yml and rendered conda environment files.
bendnorman Dec 4, 2023
41c4415
Merge branch 'dev' into rename-core-assets
bendnorman Dec 13, 2023
660eaff
Remove some old table names from metaddata
bendnorman Dec 14, 2023
aaa99ee
Update conda-lock.yml and rendered conda environment files.
bendnorman Dec 14, 2023
e17bd42
Merge branch 'dev' into rename-core-assets
bendnorman Dec 14, 2023
1ad3830
Merge branch 'rename-core-assets' of github.com:catalyst-cooperative/…
bendnorman Dec 14, 2023
be7e5c2
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Dec 14, 2023
2414de7
Remove ref to non existant doc page, remove files no longer in dev
bendnorman Dec 14, 2023
4ecfc86
Merge branch 'dev' into rename-core-assets
bendnorman Dec 15, 2023
9f3d293
Merge branch 'dev' into rename-core-assets
bendnorman Dec 15, 2023
9544618
Merge branch 'dev' into rename-core-assets
bendnorman Dec 15, 2023
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion devtools/data-release.sh
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This file has been deleted from dev. Do we know why it's showing up here now?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah that is strange. I'll look into it.

Original file line number Diff line number Diff line change
Expand Up @@ -57,7 +57,7 @@ gsutil -m cp "$GCS_URL/*.sqlite" "$GCS_URL/ferc*_xbrl_*.json" $RELEASE_DIR/pudl_

echo "Copying Parquet datasets..."
mkdir -p $RELEASE_DIR/pudl_data/parquet/epacems
gsutil -m cp -r "$GCS_URL/hourly_emissions_epacems/*" $RELEASE_DIR/pudl_data/parquet/epacems
gsutil -m cp -r "$GCS_URL/core_epacems__hourly_emissions/*" $RELEASE_DIR/pudl_data/parquet/epacems

# Save the Docker image as a tarball so it can be archived with the data:
echo "Saving Docker image: $DOCKER_IMAGE"
Expand Down
2 changes: 1 addition & 1 deletion devtools/debug-eia-etl.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -263,7 +263,7 @@
"outputs": [],
"source": [
"%%time\n",
"asset_key = \"fuel_receipts_costs_eia923\"\n",
"asset_key = \"core_eia923__monthly_fuel_receipts_costs\"\n",
"df = defs.load_asset_value(AssetKey(asset_key))\n",
"\n",
"df.head()"
Expand Down
16 changes: 8 additions & 8 deletions devtools/debug-ferc1-etl.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -99,7 +99,7 @@
},
"outputs": [],
"source": [
"ferc1_xbrl_raw_dfs[\"fuel_ferc1\"][\"duration\"].report_year"
"ferc1_xbrl_raw_dfs[\"core_ferc1__yearly_fuel\"][\"duration\"].report_year"
]
},
{
Expand Down Expand Up @@ -206,7 +206,7 @@
"metadata": {},
"outputs": [],
"source": [
"table_name = \"other_regulatory_liabilities_ferc1\"\n",
"table_name = \"core_ferc1__yearly_other_regulatory_liabilities\"\n",
"TRANSFORMER = transformers[table_name] # add a table here"
]
},
Expand Down Expand Up @@ -326,8 +326,8 @@
"source": [
"transformed_tables = {}\n",
"for table_name, transformer in transformers.items():\n",
" if table_name == \"plants_steam_ferc1\":\n",
" # plants_steam_ferc1 is a special case. It depends on the transformed fuel_ferc1 table.\n",
" if table_name == \"core_ferc1__yearly_plants_steam\":\n",
" # core_ferc1__yearly_plants_steam is a special case. It depends on the transformed core_ferc1__yearly_fuel table.\n",
" continue\n",
" transformed_tables[transformer.table_id.value] = transformer.transform(\n",
" raw_dbf=ferc1_dbf_raw_dfs[transformer.table_id.value],\n",
Expand All @@ -345,13 +345,13 @@
},
"outputs": [],
"source": [
"# Handle special case for \"plants_steam_ferc1\"\n",
"transformer = transformers[\"plants_steam_ferc1\"]\n",
"# Handle special case for \"core_ferc1__yearly_plants_steam\"\n",
"transformer = transformers[\"core_ferc1__yearly_plants_steam\"]\n",
"transformed_tables[transformer.table_id.value] = transformer.transform(\n",
" raw_dbf=ferc1_dbf_raw_dfs[transformer.table_id.value],\n",
" raw_xbrl_instant=ferc1_xbrl_raw_dfs[transformer.table_id.value][\"instant\"],\n",
" raw_xbrl_duration=ferc1_xbrl_raw_dfs[transformer.table_id.value][\"duration\"],\n",
" transformed_fuel=transformed_tables[\"fuel_ferc1\"],\n",
" transformed_fuel=transformed_tables[\"core_ferc1__yearly_fuel\"],\n",
")"
]
}
Expand All @@ -372,7 +372,7 @@
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3",
"version": "3.11.3"
"version": "3.11.5"
}
},
"nbformat": 4,
Expand Down
75 changes: 70 additions & 5 deletions devtools/inspect-assets.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -50,10 +50,61 @@
"\n",
"from pudl.etl import defs\n",
"\n",
"asset_key = \"raw_generator_existing_eia860\"\n",
"asset_key = \"exploded_balance_sheet_assets_ferc1\"\n",
"df = defs.load_asset_value(AssetKey(asset_key))\n",
"\n",
"df.head()"
"#df[df.row_type_xbrl == \"correction\"].xbrl_factoid.value_counts()\n",
"#df[(df.xbrl_factoid.isin([\"operation_expense\", \"maintenance_expense\"]))&(df.rel_diff.notnull())&(df.rel_diff!=0)].sort_values(['utility_id_ferc1', 'report_year', 'xbrl_factoid', 'rel_diff']).head(50)"
]
},
{
"cell_type": "code",
"execution_count": null,
"id": "b2d99594",
"metadata": {},
"outputs": [],
"source": [
"df[(df.xbrl_factoid==\"accumulated_depreciation\")&(df.plant_status==\"in_service\")&(df.plant_function==\"total\")]"
]
},
{
"cell_type": "code",
"execution_count": null,
"id": "467111b1",
"metadata": {},
"outputs": [],
"source": [
"df[df.xbrl_factoid.isin(factoids)&(df.utility_id_ferc1==9)&(df.report_year==1998)]"
]
},
{
"cell_type": "code",
"execution_count": null,
"id": "c6f7427a",
"metadata": {},
"outputs": [],
"source": [
"factoids = ['distribution_maintenance_expense_electric',\n",
" 'hydraulic_power_generation_maintenance_expense',\n",
" 'maintenance_of_general_plant',\n",
" 'nuclear_power_generation_maintenance_expense',\n",
" 'other_power_generation_maintenance_expense',\n",
" 'regional_market_maintenance_expense',\n",
" 'steam_power_generation_maintenance_expense',\n",
" 'transmission_maintenance_expense_electric']"
]
},
{
"cell_type": "code",
"execution_count": null,
"id": "951b718d",
"metadata": {},
"outputs": [],
"source": [
"asset_key = \"calculation_components_xbrl_ferc1\"\n",
"calcs = defs.load_asset_value(AssetKey(asset_key))\n",
"\n",
"calcs[(calcs.xbrl_factoid_parent == \"accumulated_depreciation\")].head(50)"
]
},
{
Expand All @@ -77,10 +128,24 @@
"\n",
"from pudl.etl import defs\n",
"\n",
"asset_key = \"fuel_receipts_costs_eia923\"\n",
"asset_key = \"emissions_unit_ids_epacems\"\n",
"df = defs.load_asset_value(AssetKey(asset_key))\n",
"\n",
"df.head()"
"df"
]
},
{
"cell_type": "code",
"execution_count": null,
"id": "9f0d118b",
"metadata": {},
"outputs": [],
"source": [
"from pudl.output.epacems import epacems\n",
"\n",
"test_epacems = epacems(states = [\"ID\"], years = [2022])\n",
"\n",
"test_epacems[test_epacems.operating_datetime_utc>=\"2022-01-04\"].head(40)"
]
}
],
Expand All @@ -100,7 +165,7 @@
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3",
"version": "3.10.10"
"version": "3.11.5"
}
},
"nbformat": 4,
Expand Down
2 changes: 1 addition & 1 deletion devtools/python-output-table-conversion-debug.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -109,7 +109,7 @@
"@asset(io_manager_key=\"pudl_sqlite_io_manager\", compute_kind=\"Python\")\n",
"def denorm_utilities_eia860(\n",
" utilities_entity_eia: pd.DataFrame,\n",
" utilities_eia860: pd.DataFrame,\n",
" core_eia860__scd_utilities: pd.DataFrame,\n",
" utilities_eia: pd.DataFrame,\n",
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should utilities_entity_eia be core_eia__entity_utilities and utilities_eia be core_pudl__assn_utilities_eia?

"):\n",
" ... # joining logic\n",
Expand Down
2 changes: 1 addition & 1 deletion docs/data_access.rst
Original file line number Diff line number Diff line change
Expand Up @@ -83,7 +83,7 @@ AWS CLI, or programmatically via the S3 API. They can also be downloaded directl
HTTPS using the following links:

* `PUDL SQLite DB <https://s3.us-west-2.amazonaws.com/pudl.catalyst.coop/dev/pudl.sqlite>`__
* `EPA CEMS Hourly Emissions Parquet (1995-2021) <https://s3.us-west-2.amazonaws.com/pudl.catalyst.coop/dev/hourly_emissions_epacems.parquet>`__
* `EPA CEMS Hourly Emissions Parquet (1995-2021) <https://s3.us-west-2.amazonaws.com/pudl.catalyst.coop/dev/core_epacems__hourly_emissions.parquet>`__
* `Census DP1 SQLite DB (2010) <https://s3.us-west-2.amazonaws.com/pudl.catalyst.coop/dev/censusdp1tract.sqlite>`__

* Raw FERC Form 1:
Expand Down
8 changes: 4 additions & 4 deletions docs/dev/annual_updates.rst
Original file line number Diff line number Diff line change
Expand Up @@ -338,10 +338,10 @@ A. FERC 1 & EIA Plants & Utilities
B. Missing EIA Plant Locations from CEMS
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
**6.B.1)** If there are any plants that appear in the EPA CEMS dataset that do not
appear in the ``plants_entity_eia`` table, or that are missing latitude and longitude
values, you'll get a warning when you try and materialize the ``epacamd`` asset group in
Dagster. You'll need to manually compile the missing information and add it to
``src/pudl/package_data/epacems/additional_epacems_plants.csv`` to enable accurate
appear in the ``core_eia__entity_plants`` table, or that are missing latitude and
longitude values, you'll get a warning when you try and materialize the ``core_epacamd``
asset group in Dagster. You'll need to manually compile the missing information and add
it to ``src/pudl/package_data/epacems/additional_epacems_plants.csv`` to enable accurate
adjustment of the EPA CEMS timestamps to UTC. Using the Plant ID from the warning, look
up the plant coordinates in the
`EPA FACT API <https://www.epa.gov/airmarkets/field-audit-checklist-tool-fact-api>`__.
Expand Down
2 changes: 1 addition & 1 deletion docs/dev/dev_setup.rst
Original file line number Diff line number Diff line change
Expand Up @@ -266,7 +266,7 @@ store the path to this new directory:
$ set -Ux PUDL_OUTPUT /absolute/path/to/pudl_output # fish

The path stored in ``PUDL_OUTPUT`` contains all ETL outputs like
``pudl.sqlite`` and ``hourly_emissions_epacems.parquet``.
``pudl.sqlite`` and ``core_epacems__hourly_emissions.parquet``.

**Make sure you create separate directories for these environment variables!
It is recommended you create these directories outside of the pudl repository
Expand Down
4 changes: 2 additions & 2 deletions docs/dev/naming_conventions.rst
Original file line number Diff line number Diff line change
Expand Up @@ -19,7 +19,7 @@ as we come across them again in maintaining the code.
* When dataframe outputs are built from multiple tables, identify the type of
information being pulled (e.g. "plants") and the source of the tables (e.g.
``eia`` or ``ferc1``). When outputs are built from a single table, simply use
the table name (e.g. ``boiler_fuel_eia923``).
the table name (e.g. ``core_eia923__monthly_boiler_fuel``).

.. _glossary:

Expand Down Expand Up @@ -89,7 +89,7 @@ Output Functions
When dataframe outputs are built from multiple tables, identify the type of
information being pulled (e.g. ``plants``) and the source of the tables (e.g.
``eia`` or ``ferc1``). When outputs are built from a single table, simply use
the table name (e.g. ``boiler_fuel_eia923``).
the table name (e.g. ``core_eia923__monthly_boiler_fuel``).

Table Names
-----------
Expand Down
2 changes: 1 addition & 1 deletion docs/dev/run_the_etl.rst
Original file line number Diff line number Diff line change
Expand Up @@ -490,7 +490,7 @@ The commands above should result in a bunch of Python :mod:`logging` output
describing what the script is doing, and file outputs in the ``output``
directory within your workspace. When the ETL is complete, you
should see new files at ``output/ferc1.sqlite`` and ``output/pudl.sqlite`` as
well as a new directory at ``output/hourly_emissions_epacems`` containing
well as a new directory at ``output/core_epacems__hourly_emissions`` containing
nested directories named by year and state.

If you need to re-run ``ferc_to_sqlite`` and want to overwrite
Expand Down
8 changes: 4 additions & 4 deletions docs/intro.rst
Original file line number Diff line number Diff line change
Expand Up @@ -168,10 +168,10 @@ Denormalized Outputs
We normalize the data to make storage more efficient and avoid data integrity issues,
but you may want to combine information from more than one of the tables to make the
data more readable and readily interpretable. For example, PUDL stores the name that EIA
uses to refer to a power plant in the :ref:`plants_entity_eia` table in association with
the plant's unique numeric ID. If you are working with data from the
:ref:`fuel_receipts_costs_eia923` table, which records monthly per-plant fuel
deliveries, you may want to have the name of the plant alongside the fuel delivery
uses to refer to a power plant in the :ref:`core_eia__entity_plants` table in
association with the plant's unique numeric ID. If you are working with data from the
:ref:`core_eia923__monthly_fuel_receipts_costs` table, which records monthly per-plant
fuel deliveries, you may want to have the name of the plant alongside the fuel delivery
information since it's more recognizable than the plant ID.

Rather than requiring everyone to write their own SQL ``SELECT`` and ``JOIN`` statements
Expand Down
Loading