Wrangle and Analyze Data
I can:
-Use Anaconda to manage a programming environment
-Investigate a dataset using Python data analysis packages
-Perform the entire data analysis process on a dataset
-Gather data from multiple sources in a variety of formats
-Assess the quality and tidiness of data visually and programmatically
-Clean data using Python and Pandas
-Visualize the data
I successfully completed Data Wrangling student project in Udacity Data Analysis Professional Nanodegree Program. In this project, I used Tweepy to query Twitter's API for data included in the WeRateDogs Twitter archive. This data included retweet count and favorite count. I developed python code to create an API object that I used to gather Twitter data. After querying each tweet ID, I wrote its JSON data to a tweet_json.txt file with each tweet's JSON data on its own line. I then read this file, line by line, to create a pandas DataFrame that I assessed and cleaned.