-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat(load): add arrow endpoints #2200
Open
TheoPascoli
wants to merge
44
commits into
dev
Choose a base branch
from
feat/add-load-endpoints-with-arrow
base: dev
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
+190
−1
Open
Changes from all commits
Commits
Show all changes
44 commits
Select commit
Hold shift + click to select a range
ad63883
feat: first commit
TheoPascoli 62f1b1d
feat: first commit
TheoPascoli cc01641
Merge branch 'dev' into feat/add-load-endpoints-with-arrow
TheoPascoli d73c8de
Merge branch 'dev' into feat/add-load-endpoints-with-arrow
TheoPascoli 998c04e
feat(bc): use `update_config` instead of `update_bc` for multiple upd…
MartinBelthle f4206fe
build(python): bump python version to use v3.11 (#2164)
MartinBelthle f8b0f8a
feat(ts-gen): display progress bar via websockets (#2194)
MartinBelthle ebd2df4
feat: first commit
TheoPascoli 9a3591a
feat: first commit
TheoPascoli fddf3b8
feat: first commit
TheoPascoli 582aed0
feat: first commit
TheoPascoli 72ec467
Merge branch 'feat/add-load-endpoints-with-arrow' of https://github.c…
TheoPascoli 43a5e40
feat: Refactor load management to use LoadInfoDTO
TheoPascoli 4e74852
feat: refactor load series API response model
TheoPascoli c99cdb1
feat: add support for pyarrow in load management
TheoPascoli 0a4e6f5
feat: refactor load series api response model
TheoPascoli 1b6dc2b
feat: add support for pyarrow in load management
TheoPascoli cae2930
Merge remote-tracking branch 'origin/feat/add-load-endpoints-with-arr…
TheoPascoli 1b86d9a
feat: fix whitespace issue in load management module
TheoPascoli f45f065
feat: refactor load management response
TheoPascoli 6506459
Merge remote-tracking branch 'origin/dev' into feat/add-load-endpoint…
TheoPascoli 213cbaf
feat: refactor `get_load_matrix` method to improve readability
TheoPascoli 7b5e0a0
feat: add integration test for load series endpoint
TheoPascoli 37b7108
feat: add endpoint to update load series data
TheoPascoli 1ef1ff1
feat: refactor LoadDTO and add LoadProperties model
TheoPascoli 4a234e6
feat: refactor link path handling in link management.
TheoPascoli c0a82de
Merge remote-tracking branch 'origin/dev' into feat/add-load-endpoint…
TheoPascoli e91dc71
Merge remote-tracking branch 'origin/dev' into feat/add-load-endpoint…
TheoPascoli 997803a
feat: refactor load series endpoint paths and add data conversion
TheoPascoli 255b104
Merge branch 'dev' into feat/add-load-endpoints-with-arrow
TheoPascoli aff9c83
feat: remove JSON format support for load matrix endpoints
TheoPascoli d63cb8e
feat: add docstrings and fix IO imports in load management
TheoPascoli a522a22
feat: add docstrings and fix IO imports in load management
TheoPascoli 8fe2863
Merge remote-tracking branch 'origin/dev' into feat/add-load-endpoint…
TheoPascoli 9d55b36
Merge remote-tracking branch 'origin/feat/add-load-endpoints-with-arr…
TheoPascoli 5df3960
feat: add docstrings and fix IO imports in load management
TheoPascoli 09bc218
Merge branch 'dev' into feat/add-load-endpoints-with-arrow
TheoPascoli 7e3b03a
feat: remove web considerations that were inside the business layer
TheoPascoli 6277ae0
Merge remote-tracking branch 'origin/dev' into feat/add-load-endpoint…
TheoPascoli 5fd6880
feat: set up GET endpoint that returns a arrow matrix with MatrixInde…
TheoPascoli feb957f
feat: remove useless load_model.py
TheoPascoli cb3d3f4
feat: remove useless load_model.py
TheoPascoli fe6acbd
feat: change way of dealing with the writing of the feather file
TheoPascoli 3218843
Merge branch 'dev' into feat/add-load-endpoints-with-arrow
TheoPascoli File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,47 @@ | ||
# Copyright (c) 2024, RTE (https://www.rte-france.com) | ||
# | ||
# See AUTHORS.txt | ||
# | ||
# This Source Code Form is subject to the terms of the Mozilla Public | ||
# License, v. 2.0. If a copy of the MPL was not distributed with this | ||
# file, You can obtain one at http://mozilla.org/MPL/2.0/. | ||
# | ||
# SPDX-License-Identifier: MPL-2.0 | ||
# | ||
# This file is part of the Antares project. | ||
import os | ||
import tempfile | ||
import typing as t | ||
from io import BytesIO | ||
|
||
import pandas as pd | ||
import pyarrow as pa | ||
from pyarrow import feather | ||
from pyarrow.feather import write_feather | ||
|
||
|
||
def dataframe_to_bytes(df: pd.DataFrame, metadata: t.Optional[t.Dict[str | bytes, str | bytes]]) -> bytes: | ||
table: pa.Table = pa.Table.from_pandas(df, preserve_index=False) | ||
|
||
if metadata: | ||
metadata_bytes = {str(k): str(v) for k, v in metadata.items()} | ||
schema_metadata: t.Dict[str | bytes, str | bytes] = {k: v for k, v in metadata_bytes.items()} | ||
table = table.replace_schema_metadata(schema_metadata) | ||
|
||
buffer = BytesIO() | ||
write_feather(df=table, dest=buffer) # type:ignore | ||
|
||
return buffer.getvalue() | ||
|
||
|
||
def bytes_to_dataframe(buffer: bytes) -> pd.DataFrame: | ||
data = BytesIO(buffer) | ||
table = feather.read_table(data) | ||
|
||
df = table.to_pandas() | ||
|
||
metadata = table.schema.metadata | ||
if metadata: | ||
df.metadata = {k.decode("utf8"): v.decode("utf8") for k, v in metadata.items()} | ||
|
||
return df |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,55 @@ | ||
# Copyright (c) 2024, RTE (https://www.rte-france.com) | ||
# | ||
# See AUTHORS.txt | ||
# | ||
# This Source Code Form is subject to the terms of the Mozilla Public | ||
# License, v. 2.0. If a copy of the MPL was not distributed with this | ||
# file, You can obtain one at http://mozilla.org/MPL/2.0/. | ||
# | ||
# SPDX-License-Identifier: MPL-2.0 | ||
# | ||
# This file is part of the Antares project. | ||
|
||
import typing as t | ||
|
||
import pandas as pd | ||
|
||
from antarest.study.model import MatrixIndex, Study | ||
from antarest.study.storage.rawstudy.model.filesystem.matrix.input_series_matrix import InputSeriesMatrix | ||
from antarest.study.storage.storage_service import StudyStorageService | ||
from antarest.study.storage.utils import get_start_date | ||
|
||
LOAD_PATH = "input/load/series/load_{area_id}" | ||
matrix_columns = ["ts-0"] | ||
|
||
|
||
class LoadManager: | ||
def __init__(self, storage_service: StudyStorageService) -> None: | ||
self.storage_service = storage_service | ||
|
||
def get_load_matrix(self, study: Study, area_id: str) -> t.Tuple[pd.DataFrame, t.Dict[str | bytes, str | bytes]]: | ||
file_study = self.storage_service.get_storage(study).get_raw(study) | ||
load_path = LOAD_PATH.format(area_id=area_id).split("/") | ||
|
||
node = file_study.tree.get_node(load_path) | ||
|
||
if not isinstance(node, InputSeriesMatrix): | ||
raise TypeError(f"Expected node of type 'InputSeriesMatrix', but got '{type(node).__name__}'") | ||
|
||
matrix_data = InputSeriesMatrix.parse(node, return_dataframe=True) | ||
|
||
matrix_df = t.cast(pd.DataFrame, matrix_data) | ||
matrix_df.columns = matrix_df.columns.map(str) | ||
|
||
matrix_df.columns = pd.Index(matrix_columns) | ||
|
||
matrix_index: MatrixIndex = get_start_date(file_study) | ||
|
||
metadata: t.Dict[str | bytes, str | bytes] = { | ||
"start_date": str(matrix_index.start_date), | ||
"steps": str(matrix_index.steps), | ||
"first_week_size": str(matrix_index.first_week_size), | ||
"level": str(matrix_index.level), | ||
} | ||
|
||
return matrix_df, metadata |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,50 @@ | ||
# Copyright (c) 2024, RTE (https://www.rte-france.com) | ||
# | ||
# See AUTHORS.txt | ||
# | ||
# This Source Code Form is subject to the terms of the Mozilla Public | ||
# License, v. 2.0. If a copy of the MPL was not distributed with this | ||
# file, You can obtain one at http://mozilla.org/MPL/2.0/. | ||
# | ||
# SPDX-License-Identifier: MPL-2.0 | ||
# | ||
# This file is part of the Antares project. | ||
from io import BytesIO | ||
|
||
import pytest | ||
from starlette.testclient import TestClient | ||
|
||
from antarest.study.business.arrow_utils import bytes_to_dataframe | ||
from tests.integration.prepare_proxy import PreparerProxy | ||
|
||
|
||
@pytest.mark.unit_test | ||
class TestLoad: | ||
@pytest.mark.parametrize("study_type", ["raw", "variant"]) | ||
def test_load(self, client: TestClient, user_access_token: str, study_type: str) -> None: | ||
client.headers = {"Authorization": f"Bearer {user_access_token}"} # type: ignore | ||
|
||
preparer = PreparerProxy(client, user_access_token) | ||
study_id = preparer.create_study("foo", version=880) | ||
|
||
if study_type == "variant": | ||
study_id = preparer.create_variant(study_id, name="Variant 1") | ||
|
||
area1_id = preparer.create_area(study_id, name="Area1")["id"] | ||
|
||
# Test simple get ARROW | ||
|
||
res = client.get(f"/v1/studies/{study_id}/{area1_id}/load/series") | ||
assert res.status_code == 200 | ||
assert res.headers["content-type"] == "application/vnd.apache.arrow.file" | ||
|
||
df = bytes_to_dataframe(res.content) | ||
|
||
column_name = ["ts-0"] | ||
assert column_name == list(df.columns) | ||
assert df.metadata == { | ||
"first_week_size": "7", | ||
"level": "hourly", | ||
"start_date": "2018-01-01 00:00:00", | ||
"steps": "8760", | ||
} |
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Generally speaking for the other matrices, what is your vision here ? For hydro for instance, we want to add the code inside the HydroManager or do we want to create a specific manager on the side ? I personally prefer the 1st option