I have a comprehensive introduction to scientific computing, Python, and related tools used by data scientists. I can use Python to read, clean, process, and analyze real-world data by following good programming practices such as using functions, choosing the appropriate data structures, and writing readable, maintainable code. To determine the statistical significance of the results of their analysis, I can apply statistical analysis and hypothesis testing.
Skills:
• Data structures, algorithms, classes
• Data formats
• Multi-dimensional arrays and vectorization in NumPy
• DataFrame, Series, data ingestion and transformation with pandas
• Data aggregation in pandas
• SQL and Object-Relational Mapping
• Data munging