!pip install --upgrade pixiedust

Collecting pixiedust
Collecting geojson (from pixiedust)
  Using cached https://files.pythonhosted.org/packages/f1/34/bc3a65faabce27a7faa755ab08d811207a4fc438f77ef09c229fc022d778/geojson-2.4.1-py2.py3-none-any.whl
Collecting astunparse (from pixiedust)
  Using cached https://files.pythonhosted.org/packages/2e/37/5dd0dd89b87bb5f0f32a7e775458412c52d78f230ab8d0c65df6aabc4479/astunparse-1.6.2-py2.py3-none-any.whl
Collecting lxml (from pixiedust)


import pandas as pd

import pixiedust


!wget https://raw.githubusercontent.com/IBM/visualize-data-with-python/master/data/HuntingBayou.csv


! head -n 35 HuntingBayou.csv


df = pd.read_csv('HuntingBayou.csv',sep='\t',skiprows=(0-28),header=(29))


df.head()


df = df.drop(0)


df.rename(columns={'140488_00060': 'Discharge(cfs)', '140489_00065': 'GuageHeight(feet)'}, inplace=True)


df.rename(columns={'site_no': 'site_name'}, inplace=True)
df['site_name'].replace("08075770", "HuntingBayou.csv", inplace=True)


df.drop(['agency_cd', '140488_00060_cd', '140489_00065_cd', 'tz_cd'], axis=1, inplace=True)


df['GuageHeight(feet)'] = df['GuageHeight(feet)'].convert_objects(convert_numeric=True)
df['Discharge(cfs)'] = df['Discharge(cfs)'].convert_objects(convert_numeric=True)


df['datetime'] = df['datetime'].map(lambda x: x.lstrip('2017-'))


df['latitude']='29.808611'
df['longitude']='-95.313056'


df.head()


%matplotlib inline

import matplotlib as mpl
import matplotlib.pyplot as plt


# setup line graph
plt.plot(df['datetime'],df['Discharge(cfs)'])
plt.title('Houston Flood discharge at Hunting Bayou stream gauge')
plt.ylabel('Discharge(cfs)')
plt.xlabel('datetime')
ax = plt.gca()
df.set_index('datetime')

# Only label every 20th value
ticks_to_use = df.index[::100]
# label ticks per day
dr = pd.date_range('2017-08-23', periods=9, freq='D')

## Now set the ticks and labels
ax.set_xticks(ticks_to_use)
ax.set_xticklabels(dr)
plt.xticks(rotation='vertical')

plt.show()


# setup line graph
plt.plot(df['datetime'],df['GuageHeight(feet)'])
plt.title('Houston Flood Gauge Height at Hunting Bayou stream gauge')
plt.ylabel('GuageHeight(feet)')
plt.xlabel('datetime')
ax = plt.gca()
df.set_index('datetime')

# Only label every 20th value
ticks_to_use = df.index[::100]
# label ticks per day
dr = pd.date_range('2017-08-23', periods=9, freq='D')

## Now set the ticks and labels
ax.set_xticks(ticks_to_use)
ax.set_xticklabels(dr)
plt.xticks(rotation='vertical')

plt.show()


hunting = df
display(hunting)


!wget https://raw.githubusercontent.com/IBM/visualize-data-with-python/master/data/maxFlood.csv


maxFlood = pd.read_csv('maxFlood.csv')


maxFlood


display(maxFlood)


!wget https://raw.githubusercontent.com/IBM/visualize-data-with-python/master/data/streamGauges.geojson


from pixiedust.display.app import *
from pixiedust.apps.mapboxBase import MapboxBase

@PixieApp
class HoustonDashboard(MapboxBase):
    def setup(self):
        self.mapJSONOptions = {
      "colorrampname": "Green to Purple",
      "coloropacity": "100",
      "handlerId": "mapView",
      "kind": "simple",
      "mapboxtoken": "",
      "keyFields": "latitude,longitude",
      "valueFields": "Gage_Height(feet),Discharge(cfs),date,time"
    }
        

        self.setLayers([
        {
            "name": "Houston Flooded Streets",
            "url": "https://raw.githubusercontent.com/IBM/visualize-data-with-python/master/data/houston.geojson",
            "type": "LineString"
        },
        {
            "name": "Random fictional homes",
            "url": "https://raw.githubusercontent.com/IBM/visualize-data-with-python/master/data/HoustonFloodedZips250.geojson",
            "circle-color": "rgb(0, 255, 0)"
        }
        ])
    def formatOptions(self,options):
        return ';'.join(["{}={}".format(key,value) for (key, value) in iteritems(options)])

    @route()
    def mainScreen(self):
        return """
<div class="well">
    <center><span style="font-size:x-large">Analyzing Houston Flood data with PixieDust</span></center>
</div>
<div class="row">
    <div class="form-group col-sm-2" style="padding-right:10px;">
        <div><strong>Layers</strong></div>
        {% for layer in this.layers %}
        <div class="rendererOpt checkbox checkbox-primary">
            <input type="checkbox" pd_refresh="map{{prefix}}" pd_script="self.toggleLayer({{loop.index0}})">
            <label>{{layer["name"]}}</label>
        </div>
        {%endfor%}
    </div>
    <div class="form-group col-sm-10">
        <div id="map{{prefix}}" pd_entity pd_options="{{this.formatOptions(this.mapJSONOptions)}}"/>
    </div>
</div>
"""

HoustonDashboard().run(maxFlood,runInDialog="false")


!pip install folium==0.5.0


import folium

# define the world map centered around Canada with a higher zoom level
houston_map = folium.Map(location=[29.808611, -95.313056], zoom_start=8)

# display world map
houston_map


# instantiate a feature group for the incidents in the dataframe
gauges = folium.map.FeatureGroup()

# loop through the 100 crimes and add each to the incidents feature group
for lat, lng, in zip(maxFlood.latitude, maxFlood.longitude):
    gauges.add_child(
        folium.features.CircleMarker(
            [lat, lng],
            radius=5, # define how big you want the circle markers to be
            color='yellow',
            fill=True,
            fill_color='blue',
            fill_opacity=0.6
        )
    )

# add incidents to map
houston_map.add_child(gauges)


# instantiate a feature group for the stream gauges in the dataframe
gauges = folium.map.FeatureGroup()

# loop through the stream gauges and add each to the gauges feature group
for lat, lng, in zip(maxFlood.latitude, maxFlood.longitude):
    gauges.add_child(
        folium.features.CircleMarker(
            [lat, lng],
            radius=5, # define how big you want the circle markers to be
            color='yellow',
            fill=True,
            fill_color='blue',
            fill_opacity=0.6
        )
    )

# add site_name pop-up text to each marker on the map
latitudes = list(maxFlood.latitude)
longitudes = list( maxFlood.longitude)
label = list(maxFlood.site_name)

for lat, lng, label in zip(latitudes, longitudes, label):
    folium.Marker([lat, lng], popup=label).add_to(houston_map)    
    
# add gauges to map
houston_map.add_child(gauges)

# add clickable lat and long info
houston_map.add_child(folium.LatLngPopup())


houston_map.add_child(folium.ClickForMarker(popup='My House'))


open

STSA Data Visualization With Python¶

A Jupyter Notebook used to visualize data from the Houston Flood of 2017 running on IBM Watson Studio¶

Jupyter Notebook ¶

IBM Watson Studio ¶

Python ¶

Contents¶

1.0 Install dependencies and import packages¶

1.1 Install pixiedust¶

1.2 Import the packages¶

2.0 Obtain and curate data¶

Where in Houston does flooding occur, and which specific adresses are vulnerable?¶

2.1 Search for data¶

2.2 Download and examine data¶

2.3 First, let's look at the header to the file (which I've peeked at in an editor). This gives us some info on the contents:¶

2.4 Look at the pandas dataframe¶

2.5 Do some data frame cleanup¶

2.6 Use Matplotlib to visualize data¶

Plot the Discharge against time¶

Plot the Gauge Height against time¶

2.7 Use pixiedust `display()` to explore the schema and browse the data¶

2.7.1 Select DataFrame Table icon in the display widget to see the data in tabular form¶

2.7.2 Select the chart icon to pull down and choose `line chart`. Click the `Options` button, and then for `Keys` drag and drop `datetime` and for `Values` drag and drop `Discharge`. This will display the water discharge at this stream gauge in cubic feet per second.¶

2.7.4 Click the `Options` button, and then for `Keys` drag and drop `datetime` and for `Values` drag and drop `Gauge_Height`. This will display the height of the water at this stream gauge, in feet.¶

2.8 Gather data for Max stream flows¶

We have already:¶

3.0 Create Pixie App¶

Building the PixieApp Dashboard¶

What you'll need:¶

FAQ about the code below:¶

4.0 Use Folium for mapping¶

4.1 Create map with Folium¶

4.2 Change Folium tiles¶

4.3 Add feature group¶

4.4 Add text and lat/long¶

4.5 Add ability to drop markers on-the-fly¶

Exercise: See if you can find your `Assigned House` and drop a marker at that latitude and longitude location¶

5.0 Explore more tools¶

STSA Data Visualization With Python¶

A Jupyter Notebook used to visualize data from the Houston Flood of 2017 running on IBM Watson Studio¶

Jupyter Notebook¶

IBM Watson Studio¶

Python¶

Contents¶

1.0 Install dependencies and import packages¶

1.1 Install pixiedust¶

1.2 Import the packages¶

2.0 Obtain and curate data¶

Where in Houston does flooding occur, and which specific adresses are vulnerable?¶

2.1 Search for data¶

2.2 Download and examine data¶

2.3 First, let's look at the header to the file (which I've peeked at in an editor). This gives us some info on the contents:¶

2.4 Look at the pandas dataframe¶

2.5 Do some data frame cleanup¶

2.6 Use Matplotlib to visualize data¶

Plot the Discharge against time¶

Plot the Gauge Height against time¶

2.7 Use pixiedust display() to explore the schema and browse the data¶

2.7.1 Select DataFrame Table icon in the display widget to see the data in tabular form¶

2.7.2 Select the chart icon to pull down and choose line chart. Click the Options button, and then for Keys drag and drop datetime and for Values drag and drop Discharge. This will display the water discharge at this stream gauge in cubic feet per second.¶

2.7.4 Click the Options button, and then for Keys drag and drop datetime and for Values drag and drop Gauge_Height. This will display the height of the water at this stream gauge, in feet.¶

2.8 Gather data for Max stream flows¶

We have already:¶

3.0 Create Pixie App¶

Building the PixieApp Dashboard¶

What you'll need:¶

FAQ about the code below:¶

4.0 Use Folium for mapping¶

4.1 Create map with Folium¶

4.2 Change Folium tiles¶

4.3 Add feature group¶

4.4 Add text and lat/long¶

4.5 Add ability to drop markers on-the-fly¶

Exercise: See if you can find your Assigned House and drop a marker at that latitude and longitude location¶

5.0 Explore more tools¶

Jupyter Notebook ¶

IBM Watson Studio ¶

Python ¶

2.7 Use pixiedust `display()` to explore the schema and browse the data¶

2.7.2 Select the chart icon to pull down and choose `line chart`. Click the `Options` button, and then for `Keys` drag and drop `datetime` and for `Values` drag and drop `Discharge`. This will display the water discharge at this stream gauge in cubic feet per second.¶

2.7.4 Click the `Options` button, and then for `Keys` drag and drop `datetime` and for `Values` drag and drop `Gauge_Height`. This will display the height of the water at this stream gauge, in feet.¶

Exercise: See if you can find your `Assigned House` and drop a marker at that latitude and longitude location¶