Skip to content

Instantly share code, notes, and snippets.

View liannewriting's full-sized avatar

Lianne & Justin @ Just into Data liannewriting

View GitHub Profile
@liannewriting
liannewriting / arima_model_manual.py
Created August 9, 2022 13:51
time series prediction arima model python
from statsmodels.tsa.arima.model import ARIMA
model = ARIMA(df_train, order=(2,1,0))
model_fit = model.fit()
print(model_fit.summary())
@liannewriting
liannewriting / adf_test_difference.py
Created August 9, 2022 13:47
time series prediction arima model python
adf_test = adfuller(df_train_diff)
print(f'p-value: {adf_test[1]}')
@liannewriting
liannewriting / difference_acf_pacf_plots.py
Created August 9, 2022 13:47
time series prediction arima model python
acf_diff = plot_acf(df_train_diff)
pacf_diff = plot_pacf(df_train_diff)
@liannewriting
liannewriting / difference_training.py
Created August 9, 2022 13:46
time series prediction arima model python
df_train_diff = df_train.diff().dropna()
df_train_diff.plot()
@liannewriting
liannewriting / statsmodels_adf_test.py
Created August 9, 2022 13:45
time series prediction arima model python
from statsmodels.tsa.stattools import adfuller
adf_test = adfuller(df_train)
print(f'p-value: {adf_test[1]}')
@liannewriting
liannewriting / statsmodels_acf_pacf_plots.py
Created August 9, 2022 13:41
time series prediction arima model python
from statsmodels.graphics.tsaplots import plot_acf, plot_pacf
acf_original = plot_acf(df_train)
pacf_original = plot_pacf(df_train)
@liannewriting
liannewriting / train_test_split.py
Last active August 17, 2022 15:39
time series prediction arima model python
msk = (df.index < len(df)-30)
df_train = df[msk].copy()
df_test = df[~msk].copy()
@liannewriting
liannewriting / transform_data_log.py
Created August 9, 2022 13:37
time series prediction arima model python
import numpy as np
df = np.log(df) # don't forget to transform the data back when making real predictions
df.plot()
@liannewriting
liannewriting / import_time_series_data.py
Created August 9, 2022 13:35
time series prediction arima model python
import pandas as pd
df = pd.read_csv('website_data.csv')
df.info()
df.plot()
import streamlit as st
import pandas as pd
import plotly.express as px
st.write('# Avocado Prices dashboard') #st.title('Avocado Prices dashboard')
st.markdown('''
This is a dashboard showing the *average prices* of different types of :avocado:
Data source: [Kaggle](https://www.kaggle.com/datasets/timmate/avocado-prices-2020)
''')