EASE-ReD

CMPT 353 - E100 SPRING 2024 Final Project

An exploratory analysis to investigate the potential correlation between restaurant cuisine and the ethnic demographics of the local population.

Group

Name	Student Number	Git ID
Heorhii Shramko	301428235	ShayGeko
Eunsong Koh	301549157	eunsongkoh
Tianyu Liu	301249861	tla109

Prerequisties

To run this project, Python3 and the following libraries must be installed:

Installing Python3:

Download and install Python3 from the official website.

Installing Required Libraries:

Pandas
Numpy
Pytorch
Matplotlib
Sentence_transformers
Pyspark
Tqdm
Sklearn
Dask
shutil
PyYaml

You can install the required Python libraries using pip, Python's package installer. Open a terminal or command prompt and execute the following commands:

pip3 install torch pandas numpy matplotlib pyyaml tqdm scikit-learn dask

Run

Step 1. Clone the Repository

git clone [email protected]:ShayGeko/EASE-ReD.git

cd ProjectTourOSM

Step 2. (Optional) Generate Embeddings:

Gets the data from ./bingMaps/restaurantCategory/ and produce ./embeddings/pca_category_bing_embeddings.csv
and ./embeddingscategory_bing_embeddings.csv

python3 create_embeddings.py

Step 3. Train on the Embeddings

Go to configs/ce_pca_category.yml and increment the counter in the name
e.g. name: 'ce-category-embedding-1' -> name: 'ce-category-embedding-2'
From the root directory:

python3 train.py configs/ce_pca_category.yml

will train with CrossEntropy loss on the PCA'd embeddings

If there was a problem with embedding generation (even though there shouldnt be 🙏), you can use the other embedding file for names instead of categories. Just change the config file in Step 3 from ce_pca_category.yml to ce_pca_name.yml

Then one can observe results in under experiments/<experiment name from config file>/ The predictions are stored every 1000 epochs under visuals/ and the loss is plotted iteratively in loss.png

Data Visualizations

From the root directory:
python3 visualize.py <experiment name from config file>

Name		Name	Last commit message	Last commit date
Latest commit History 99 Commits
.idea		.idea
MAE_data		MAE_data
configs		configs
data		data
experiments		experiments
model		model
prediction		prediction
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
coordinates.csv		coordinates.csv
demographics_predictor.ipynb		demographics_predictor.ipynb
notes.txt		notes.txt
output.txt		output.txt
requirements.txt		requirements.txt
secrets.json		secrets.json
train.py		train.py
tukey.png		tukey.png
visualize.py		visualize.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EASE-ReD

Group

Prerequisties

Installing Python3:

Installing Required Libraries:

Run

Step 1. Clone the Repository

Step 2. (Optional) Generate Embeddings:

Step 3. Train on the Embeddings

Data Visualizations

License

About

Releases

Packages

Contributors 3

Languages

License

ShayGeko/EASE-ReD

Folders and files

Latest commit

History

Repository files navigation

EASE-ReD

Group

Prerequisties

Installing Python3:

Installing Required Libraries:

Run

Step 1. Clone the Repository

Step 2. (Optional) Generate Embeddings:

Step 3. Train on the Embeddings

Data Visualizations

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages