Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix Windows Script Issues #70

Closed
wants to merge 17 commits into from
Closed
Show file tree
Hide file tree
Changes from 15 commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
16 changes: 9 additions & 7 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -70,7 +70,9 @@ Installed version of *Python3*, *Docker* and
*docker-compose v2* ([*Go here for instructions*](https://docs.docker.com/compose/install/)) or use the Binder batch in
the next section.

***Note***: We recommend giving Docker at least 8 GB of RAM (On Docker Desktop you can go under settings -> resources)
***Note***:
1) We recommend giving Docker at least 8 GB of RAM (On Docker Desktop you can go under settings -> resources)
2) If you are a Windows user, we recommend you to install the *Windows Subsystem for Linux (WSL)* and integrate docker with WSL. For WSL installation's guidance can be found [here](https://docs.microsoft.com/en-us/windows/wsl/install). We use Ubuntu on WSL for debug and testing purposes.

#### Demo correlating Uber traversals with Google popularities

Expand All @@ -87,15 +89,15 @@ You could either use the deployed example on Binder using the badge above or run
simply uses Pandas dataframes and is not connecting to a data warehouse. \
To run the demo locally, launch Docker in the background and from inside the root directory run:

Linux/Mac:
```zsh
cd kuwala/scripts && sh initialize_core_components.sh && sh run_cli.sh
```
and for Windows (Please use PowerShell or any Docker integrated terminal):
```PS
cd kuwala/scripts && sh initialize_windows.sh && cd windows && sh initialize_core_components.sh && sh run_cli.sh
cd kuwala/scripts/shell && sh initialize_core_components.sh && sh run_cli.sh
```
Or if you are a Windows User having issue to run a `shell` script, you can run a `python` script as an alternative:
IritaSee marked this conversation as resolved.
Show resolved Hide resolved

```zsh
cd kuwala/scripts/python && python3 initialize_core_components.py && python3 run_cli.py
```
*All `shell` scripts are also available in `python` script using the same file naming, inside the `/python` directory*
#### Run the data pipelines yourself

To run the pipelines yourself, please follow the instructions for the
Expand Down
18 changes: 9 additions & 9 deletions kuwala/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -9,14 +9,14 @@ Installed version of *Docker* and *docker-compose v2*

### Pipelines

If you want to build all containers for all pipelines, change your working directory to `./kuwala/scripts` (or move to
`./kuwala/scripts/`, run `initialize_windows.sh`, and change directory to `windows/` if you are running a Windows
machine) and run:
If you want to build all containers for all pipelines, change your working directory to `./kuwala/scripts` and run:
IritaSee marked this conversation as resolved.
Show resolved Hide resolved


```zsh
sh initialize_all_components.sh
python3 initialize_all_components.py
IritaSee marked this conversation as resolved.
Show resolved Hide resolved
```


You can also build the containers individually for single pipelines. All services are listed in the
[`./docker-compose.yml`](https://github.com/kuwala-io/kuwala/tree/master/kuwala/docker-compose.yml). Please refer to
each pipeline's `README.md` on how to run them. You can find the pipeline directories under
Expand All @@ -25,7 +25,7 @@ each pipeline's `README.md` on how to run them. You can find the pipeline direct
It would be safer to always run the build commands after pulling new code changes so that your local images
have the latest code.

Docker images will only be built once when you run the `initialize_all_components.sh` script, so new changes in the code
Docker images will only be built once when you run the `initialize_all_components.py` script, so new changes in the code
IritaSee marked this conversation as resolved.
Show resolved Hide resolved
will not reflect on your local unless you explicitly run the build commands.

Please note that the Docker runs will create data folders under `./kuwala/tmp/kuwala` that will be used for db, file
Expand All @@ -35,20 +35,20 @@ Now you can proceed to any of the pipelines' `README.md` and follow the steps to

### Core

To initialize the CLI and Jupyter notebook run within the `./kuwala/scripts` directory (or `./kuwala/scripts/windows` ):
To initialize the CLI and Jupyter notebook run within the `./kuwala/scripts` directory:
IritaSee marked this conversation as resolved.
Show resolved Hide resolved

```zsh
sh initialize_core_components.sh
python3 initialize_core_components.py
IritaSee marked this conversation as resolved.
Show resolved Hide resolved
```

To launch the CLI run:

```zsh
sh run_cli.sh
python3 run_cli.py
IritaSee marked this conversation as resolved.
Show resolved Hide resolved
```

If you only want to start the Jupyter environment run:

```zsh
sh run_jupyter_notebook.sh
python3 run_jupyter_notebook.py
IritaSee marked this conversation as resolved.
Show resolved Hide resolved
```
15 changes: 9 additions & 6 deletions kuwala/core/cli/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -20,20 +20,23 @@ The following pipelines can currently be selected through the CLI:

To make sure you are running the latest version of all pipelines, run from inside the root directory:

Linux/Mac:

```zsh
cd kuwala/scripts && sh initialize_all_components.sh
cd kuwala/scripts/shell && sh initialize_all_components.sh
```

Windows:
Or if you are a Windows User having issue to run a `shell` script, you can run a `python` script as an alternative:
IritaSee marked this conversation as resolved.
Show resolved Hide resolved

```zsh
cd kuwala/scripts && sh initialize_windows.sh && cd windows && sh initialize_all_components.sh
cd kuwala/scripts/python && python3 initialize_all_components.py
```

To start the CLI, run the following script from inside the `kuwala/scripts` directory and follow the instructions:
*All `shell` scripts are also available in `python` script using the same file naming, inside the `/python` directory*


To start the CLI, run the following script from inside the `kuwala/scripts/shell` directory and follow the instructions:

```zsh
sh run_cli.sh
```
```
or `python3 run_cli.py` from inside the `kuwala/scripts/python` directory
2 changes: 1 addition & 1 deletion kuwala/core/cli/src/main.py
Original file line number Diff line number Diff line change
Expand Up @@ -10,7 +10,7 @@ def launch_jupyter_notebook():
run_command('docker-compose run --service-ports jupyter', exit_keyword='or http')

webbrowser.open('http://localhost:8888/lab/tree/kuwala/notebooks/popularity_correlation.ipynb')
print('To stop and remove all Docker containers run the "sh stop_all_containers.sh"')
print('To stop and remove all Docker containers run the "python3 stop_all_containers.py"')
IritaSee marked this conversation as resolved.
Show resolved Hide resolved


if __name__ == '__main__':
Expand Down
11 changes: 5 additions & 6 deletions kuwala/pipelines/osm-poi/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -23,18 +23,17 @@ To transform the standard `pbf` files, which is the file format of the OSM data,
OSM-parquetizer is based on a Git submodule which needs to be initialized first. To initialize the submodule, run from
inside the root directory:

Linux/Mac:

```zsh
cd kuwala/scripts && sh initialize_git_submodules.sh
cd kuwala/scripts/shell && sh initialize_git_submodules.sh
```

Windows:
Or if you are a Windows User having issue to run a `shell` script, you can run a `python` script as an alternative:
IritaSee marked this conversation as resolved.
Show resolved Hide resolved

```zsh
cd kuwala/scripts && sh initialize_windows.sh && cd windows && sh initialize_git_submodules.sh
cd kuwala/scripts/python && python3 initialize_git_submodules.py
```

*All `shell` scripts are also available in `python` script using the same file naming, inside the `/python` directory*

To make sure you are running the latest version of the pipeline, build the Docker images from inside the `kuwala`
directory by running:

Expand Down
12 changes: 0 additions & 12 deletions kuwala/scripts/initialize_windows.sh

This file was deleted.

14 changes: 14 additions & 0 deletions kuwala/scripts/python/build_all_containers.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,14 @@
"""
build_all_containers.sh:
cd ..
docker-compose build postgres database-importer database-transformer jupyter admin-boundaries google-poi-api google-poi-pipeline google-trends osm-parquetizer osm-poi population-density'
"""

import os
import run_command as rc

script_dir = os.path.dirname(os.path.abspath(__file__))
os.chdir(os.path.join(script_dir,'../../'))

rc.run_command(['docker-compose build postgres database-importer database-transformer jupyter admin-boundaries google-poi-api google-poi-pipeline google-trends osm-parquetizer osm-poi population-density'])

22 changes: 22 additions & 0 deletions kuwala/scripts/python/build_cli.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,22 @@
"""
build_cli.sh:

cd ../..
pip3 install virtualenv
virtualenv -p python3 venv
source ./venv/bin/activate
pip install -r kuwala/core/cli/requirements.txt
pip install -e .
"""

import os
import run_command as rc

script_dir = os.path.dirname(os.path.abspath(__file__))

os.chdir(os.path.join(script_dir,'../../../'))
rc.run_command(['pip3 install virtualenv'])
rc.run_command(['virtualenv -p python3 venv'])
rc.run_command(['source ./venv/bin/activate'])
rc.run_command(['pip install -r kuwala/core/cli/requirements.txt'])
rc.run_command(['pip install -e .'])
14 changes: 14 additions & 0 deletions kuwala/scripts/python/build_jupyter_notebook.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,14 @@
"""
build_jupyter_notebook.sh:
cd ..
docker-compose build jupyter

"""

import os
import run_command as rc

script_dir = os.path.dirname(os.path.abspath(__file__))

os.chdir(os.path.join(script_dir,'../../'))
rc.run_command(['docker-compose build jupyter'])
12 changes: 12 additions & 0 deletions kuwala/scripts/python/build_postgres.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,12 @@
"""
cd ..
docker-compose build postgres
"""

import os
import run_command as rc

script_dir = os.path.dirname(os.path.abspath(__file__))

os.chdir(os.path.join(script_dir,'../../'))
rc.run_command(['docker-compose build postgres'])
12 changes: 12 additions & 0 deletions kuwala/scripts/python/create_zip_archive.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,12 @@
"""
cd ../..
git archive --format=zip HEAD -o kuwala.zip
"""

import os
import run_command as rc

script_dir = os.path.dirname(os.path.abspath(__file__))

os.chdir(os.path.join(script_dir,'../../../'))
rc.run_command(['git archive --format=zip HEAD -o kuwala.zip'])
14 changes: 14 additions & 0 deletions kuwala/scripts/python/initialize_all_components.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,14 @@
"""
sh initialize_git_submodules.sh
sh build_cli.sh
sh build_all_containers.sh
"""

import os
import run_command as rc

script_dir = os.path.dirname(os.path.abspath(__file__))

rc.run_command(['python3 initialize_git_submodules.py'])
rc.run_command(['python3 build_cli.py'])
rc.run_command(['python3 build_all_containers.py'])
14 changes: 14 additions & 0 deletions kuwala/scripts/python/initialize_core_components.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,14 @@
"""
sh build_postgres.sh
sh build_cli.sh
sh build_jupyter_notebook.sh
"""

import os
import run_command as rc

script_dir = os.path.dirname(os.path.abspath(__file__))

rc.run_command(['python3 build_postgres.py'])
rc.run_command(['python3 build_cli.py'])
rc.run_command(['python3 build_jupyter_notebook.py'])
12 changes: 12 additions & 0 deletions kuwala/scripts/python/initialize_git_submodules.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,12 @@
"""
cd ../..
git submodule update --init --recursive
"""

import os
import run_command as rc

script_dir = os.path.dirname(os.path.abspath(__file__))

os.chdir(os.path.join(script_dir,'../../../'))
rc.run_command(['git submodule update --init --recursive'])
16 changes: 16 additions & 0 deletions kuwala/scripts/python/run_cli.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,16 @@
"""
cd ../../
source ./venv/bin/activate
cd kuwala/core/cli
python3 src/main.py
"""

import os
import run_command as rc

script_dir = os.path.dirname(os.path.abspath(__file__))

os.chdir(os.path.join(script_dir,'../../../'))
rc.run_command(['source ./venv/bin/activate'])
os.chdir(os.path.join(script_dir,'../../../','kuwala/core/cli'))
rc.run_command(['python3 src/main.py'])
51 changes: 51 additions & 0 deletions kuwala/scripts/python/run_command.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,51 @@
from threading import Thread
import subprocess
import os

def run_command(command: [str], exit_keyword=None):
process = subprocess.Popen(
command,
bufsize=1,
stdout=subprocess.PIPE,
stderr=subprocess.PIPE,
universal_newlines=True,
shell=True
)
thread_result = dict(hit_exit_keyword=False)

def print_std(std, result):
while True:
line = std.readline()

if len(line.strip()) > 0:
print(line if 'Stage' not in line and '%' not in line else line.strip(), end='\r')

if exit_keyword is not None and exit_keyword in line:
result['hit_exit_keyword'] = True

break

return_code = process.poll()

if return_code is not None:
if return_code != 0:
return RuntimeError()

break

stdout_thread = Thread(target=print_std, args=(process.stdout, thread_result,), daemon=True)
stderr_thread = Thread(target=print_std, args=(process.stderr, thread_result,), daemon=True)

stdout_thread.start()
stderr_thread.start()

while stdout_thread.is_alive() and stderr_thread.is_alive():
pass

if thread_result['hit_exit_keyword']:
return process



if __name__=='__main__':
run_command()
12 changes: 12 additions & 0 deletions kuwala/scripts/python/run_jupyter_notebook.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,12 @@
"""
cd ../
docker-compose run --service-ports jupyter
"""
import os
import run_command as rc

script_dir = os.path.dirname(os.path.abspath(__file__))

os.chdir(os.path.join(script_dir,'../../'))
rc.run_command(['docker-compose run --service-ports jupyter'])

14 changes: 14 additions & 0 deletions kuwala/scripts/python/stop_all_containers.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,14 @@
"""
reset
docker stop $(docker ps -a -q)
docker-compose down
docker-compose rm -f
"""

import os
import run_command as rc

rc.run_command(['reset'])
rc.run_command(['docker stop $(docker ps -a -q)'])
rc.run_command(['docker-compose down'])
rc.run_command(['docker-compose rm -f'])
Original file line number Diff line number Diff line change
@@ -1,2 +1,3 @@
cd ..
cd ..
IritaSee marked this conversation as resolved.
Show resolved Hide resolved
docker-compose build postgres database-importer database-transformer jupyter admin-boundaries google-poi-api google-poi-pipeline google-trends osm-parquetizer osm-poi population-density
Loading