from IPython.display import Image
Image(filename='cli-thumbnail.png', width='80%')

World Economic Health Indicator:¶

OECD: Composite Leading Indicators (CLI)¶

https://www.economy.com/dismal/indicators/definition/oecd_cli

Established in 1961, the Organisation for Economic Co-operation and Development (OECD) is composed of 30 member nations. The OECD acts as a forum to coordinate economic policies and promote growth and stability. The OECD Composite Leading Indicators are selected from a wide range of key short-term economic indicators (170 in total, or between 5 and 10 for each country), which reflect different sectors of the economy, and provide a gauge of future economic activity.

https://data.oecd.org/leadind/composite-leading-indicator-cli.htm

S&P 500 (^GSPC)¶

https://finance.yahoo.com/quote/%5EGSPC

Direct pull:¶

https://stats.oecd.org/sdmx-json/data/DP_LIVE/OECD.CLI.../OECD?contentType=csv&detail=code&separator=comma&csv-lang=en

Google Trends¶

And just for fun, try pulling somthing from Google Trends

%matplotlib inline
import matplotlib
import matplotlib.pyplot as plt
import io, base64, os, json, re, math
import pandas as pd
import numpy as np
import datetime
import warnings
warnings.filterwarnings('ignore')

## Building a good world representation of the CLI
cli_df = pd.read_csv('DP_LIVE_19012020143246784.csv')
set(cli_df['LOCATION'])

{'CHN', 'OECD', 'RUS'}

cli_df.head()

import calendar
full_date_list = []
for tm in cli_df['TIME']:
    year = int(tm.split('-')[0])
    month = int(tm.split('-')[1])
    full_date_list.append(tm + '-' + str(calendar.monthrange(year, month)[1]))

cli_df['Date'] = full_date_list
cli_df['Date'] = pd.to_datetime(cli_df['Date'])
cli_df = cli_df[['Date', 'LOCATION', 'Value']]
cli_df = cli_df.sort_values('Date', ascending=True)
print(min(cli_df['Date']), max(cli_df['Date']))
cli_df.tail()

1961-06-30 00:00:00 2019-11-30 00:00:00

# min date for each location
for ct in set(cli_df['LOCATION']):
    print(ct)
    cli_tmp = cli_df[cli_df['LOCATION']==ct]
    print(np.min(cli_tmp['Date']))

CHN
1990-01-31 00:00:00
OECD
1961-06-30 00:00:00
RUS
1992-09-30 00:00:00

cli_df = cli_df[cli_df['Date'] >= '1992-09-30']
cli_df = cli_df[['Date', 'Value']]
cli_df = cli_df.groupby('Date').mean().reset_index()

cli_df.head()

Load S&P 500 Index for Comparison and Data Exploration¶

# load the S&P 500
sp500_raw = pd.read_csv('^GSPC.csv')
sp500_raw.tail()

sp500_raw = sp500_raw[['Date','Adj Close']]
sp500_raw['Date'] = pd.to_datetime(sp500_raw['Date'])
sp500_raw = sp500_raw.sort_values('Date', ascending=True) 
print(min(sp500_raw['Date']), max(sp500_raw['Date']))
sp500_raw.head()

1927-12-30 00:00:00 2020-01-10 00:00:00

Plot both together¶

sp500_tmp = sp500_raw.copy()
sp500_tmp = sp500_tmp[sp500_tmp['Date'] >= '1992-09-30']

fig, ax = plt.subplots(figsize=(16, 10))
plt.plot(sp500_tmp['Date'], sp500_tmp['Adj Close'], label='SP500', 
         color='green')
plt.legend(loc='lower left')
second_axis = ax.twinx()
plt.plot(cli_df['Date'], cli_df['Value'], label='CLI', 
         color='blue')
second_axis.axhline(y=100, color='gray')
plt.title('CLI-World vs SP&500')
plt.legend(loc='upper left')
plt.grid()

Playing with Google Trends¶

google_trends = pd.read_csv('google-trends-refi-rates.csv', skiprows=1)
google_trends.head()

import calendar
full_date_list = []
for tm in google_trends['Month']:
    year = int(tm.split('-')[0])
    month = int(tm.split('-')[1])
    full_date_list.append(tm + '-' + str(calendar.monthrange(year, month)[1]))

google_trends['Date'] = full_date_list
google_trends['Date'] = pd.to_datetime(google_trends['Date'])
google_trends = google_trends[['Date', 'refinance: (Worldwide)', 'interest rates: (Worldwide)']]
print(min(google_trends['Date']), max(google_trends['Date']))
google_trends.tail()

2004-01-31 00:00:00 2020-01-31 00:00:00

cli_tmp = cli_df.copy()
cli_tmp = cli_tmp[cli_tmp['Date'] >= '2004-01-31']

fig, ax = plt.subplots(figsize=(16, 10))
plt.plot(google_trends['Date'], 
         google_trends['refinance: (Worldwide)'],
         label='Refinance', 
         color='blue')

plt.legend(loc='lower left')
second_axis = ax.twinx()
plt.plot(google_trends['Date'], 
         google_trends['interest rates: (Worldwide)'], 
         label='Interest Rates', 
         color='green')

plt.legend(loc='upper left')

third_axis = ax.twinx()
plt.plot(cli_tmp['Date'], cli_tmp['Value'], label='CLI', 
         color='black')
second_axis.axhline(y=100, color='gray')
plt.title('CLI-World vs Google Trends')
plt.legend(loc='upper right')
plt.grid()

	LOCATION	INDICATOR	SUBJECT	MEASURE	FREQUENCY	TIME	Value	Flag Codes
0	CHN	CLI	AMPLITUD	LTRENDIDX	M	1990-01	96.02216	NaN
1	CHN	CLI	AMPLITUD	LTRENDIDX	M	1990-02	95.74557	NaN
2	CHN	CLI	AMPLITUD	LTRENDIDX	M	1990-03	95.52505	NaN
3	CHN	CLI	AMPLITUD	LTRENDIDX	M	1990-04	95.35284	NaN
4	CHN	CLI	AMPLITUD	LTRENDIDX	M	1990-05	95.21412	NaN

	Date	LOCATION	Value
357	2019-10-31	CHN	99.18159
1386	2019-10-31	RUS	99.69604
358	2019-11-30	CHN	99.30731
1060	2019-11-30	OECD	99.25119
1387	2019-11-30	RUS	99.75460

	Date	Value
0	1992-09-30	101.600053
1	1992-10-31	101.555863
2	1992-11-30	101.503577
3	1992-12-31	101.448503
4	1993-01-31	101.392197

	Date	Open	High	Low	Close	Adj Close	Volume
23111	2020-01-06	3217.550049	3246.840088	3214.639893	3246.280029	3246.280029	3674070000
23112	2020-01-07	3241.860107	3244.909912	3232.429932	3237.179932	3237.179932	3420380000
23113	2020-01-08	3238.590088	3267.070068	3236.669922	3253.050049	3253.050049	3720890000
23114	2020-01-09	3266.030029	3275.580078	3263.669922	3274.699951	3274.699951	3638390000
23115	2020-01-10	3281.810059	3282.989990	3260.860107	3265.350098	3265.350098	1631680402

	Date	Adj Close
0	1927-12-30	17.660000
1	1928-01-03	17.760000
2	1928-01-04	17.719999
3	1928-01-05	17.549999
4	1928-01-06	17.660000

ViralML.com

Get the "Applied Data Science Edge"!

Web Work

Hot off the Press!

Peer 6-9 Months Into the Future with the CLI - a World Economic Health Leading Indicator

Introduction

Code