MIST 7770



Business Analytics

Tableau / BI Final Project – Fall 2020

For the project, I will request that you analyze either: i) “Global Superstore” dataset (download form course website); ii) Coffee Chain Exercise (download from course website); iii) “NFL Combine/Draft” dataset (download form course website); iv) any of the data sets outlined below (there are many possible categories to choose from); or v) any dataset of your choice. Please note that the Global Superstore dataset has three worksheets (tables) that can be connected. Orders and Returns worksheets can be linked their common key “Order ID”. The People table (may or may not be useful) can be linked to the other two via “Region” as the common key. The NFL Combine/Draft dataset has two worksheets CombineResults and NFLdraft that can be linked via their common key “Player”. The Coffee Chain has just one worksheet/table (which is completely fine). If you choose your own dataset it does not need to have multiple worksheets – you just want to make sure you have a rich enough data set to do a thorough analysis.

The requirements for the project are to push far into the data in your role as an analyst. Initial in-class assignments are designed to introduce you to Tableau and business intelligence (BI) / dashboard tools. For this project, the goal is to push that level of analysis much further. The requirements for this section entail probing into the data set you chooses to see what business intelligence you can uncover. The videos outlined on the course website will also provide insight. I leave much of this project up to your insight and creativity (i.e. the analytics consultant). For instance for the Global Superstore or Coffee Chain data sets (or any other similar data sets based on retail/consumer products), the problems that you will be exploring may pertain to: what products/sectors are under performing?; what variables drive performance levels?; are there issues with certain product lines, products, markets, pricing structures (margins), costs, etc.?.

There are multiple avenues that can be undertaken – the question you will probe into are somewhat dependent on the phenomenon captured by the data set you choose. This project should be treated like a typical case study where you’ve been given data and need to examine what problems exist. The grading scheme for this project will be based on the sophistication of your analysis and the overall level of insight derived within your findings which are articulated in your analysis. For this project, you are it is useful to apply your knowledge that you have gained from your education or professional experience.

Deliverable

Due Date: Monday Class (Nov 23 midnight – 11:59pm); Wednesday Class (Nov 25 midnight – 11:59pm)

Submit via email d.preston@tcu.edu (I will confirm receipt) or D2L

A. Visualizations

A total of 8 Visualizations with a brief description of your findings for each image (approx 2-3 sentences – just to help aid in the understanding of your image). Each visualization can consist of a worksheet or dashboard (i.e., combination of worksheet images). Copy and paste into a word document as part of your report – can be included in an Appendix. Please note that the rigor of your findings are based on the collective whole of the 8 images that you conduct. You will not need to submit the Tableau file itself. Just a word document with your strategic analysis and then the 8 visualizations copied into the Appendix. You can copy your images directly from Tableau – go to the “Worksheet” tab at the top – then “copy”, “image” then paste into Word. You do not need to show filters, pill section (e.g., rows, columns), etc. The quick 2-3 sentences will shed light on what you did. If you do wish to show everything, on your keyboard you can hit “print screen” and copy into word.

B. Written Strategic Analysis

The analysis should explore problems that may pertain to (Superstore as the example): What products/sectors are under performing? What correlates with profit or other performance measures? Are there issues with certain product lines, products, markets, pricing structures (margins), costs, etc.? This project includes a strategic analysis of the findings. The project write-up should provide detailed tactical and strategic recommendations based on the findings. The project write-up should consider the executive management team of the organization (relevant to your data set) as the recipient of your analysis. A key skill required is to take the analytical findings derived and articulate these findings to a top management audience. In addition, this section should address strategic recommendations regarding the future practices. This will require critical writing skills so that highly technical details are provided succinctly to an executive audience. The quality of the analysis will be based on the sophistication of the analysis and the overall level of insight derived within the findings articulated in your analysis. The length of this analysis is to be approximately 1.5-2 pages single spaced (2 pages max).

Data Sets

The project should focus on a singular data set for analysis. You can choose the “Global Superstore”, “Coffee Chain”, or “NFL Combine Data” that can be accessed on the course website or another dataset of your choice.

Tableau Public Resources website .

Best to sign up for a free Tableau Account to have full access



Data sets include the following categories: sports, public data, education, government, science, lifestyle, technology, health, entertainment and business. Each student is welcome to select a data set they are interested in aside from the data sets listed. However, data sets currently being analyzed for another class are not an option (conflict of interest). Please run any questions by me you have with regards to the viability of a particular data set. I would make sure the data set has enough records and variables to conduct a comprehensive analysis (some of the data sets provided below are rather meager while others are very robust). Please note, multiple students can analyze a particular data set (not a one-to-one assignment).

Sports:

Sports Analytics

Some potentially interesting data sets looking at sports analytics:

(I pulled the “NFL Combine” data set from here – but I also cleaned this data up – the excel spreadsheet posted on the course website is the “cleaned” version.

English Premiere League

This site has a comprehensive amount of data from the English Premiere League



FIFA World Cup Match Results: Matchups and results of FIFA World Cup matches from 1930 - 2014. Source: data.world Dataset (xlsx)

2018 FIFA World Cup Rosters: Goals, caps, club, and date of birth for players on 2018 FIFA World Cup rosters. Source: data.world Dataset (xlsx)

FIFA 18 Player Ratings: 17k+ players, 70+ attributes extracted from FIFA 18. Source:

Dataset (csv)

ATP Top-Ranked Singles Tennis Players: Association of Tennis Professionals' (ATP) Men's and Women's top-ranked players from 1973-2018. Dataset (csv)

Wimbledon Champions Men's and Women's championship matchups from 1877-2016. Dataset (csv)

Tour de France Statistics: Winner, distance, speed, location, and more about Tour de France since 1903. Sources: Knoema, Wikipedia Dataset (csv)

Global Sport Finances The top paying pro sports teams and the top paid athletes via ESPN. Dataset (xlsx)

Summer Olympics Medalist Dataset Every summer Olympic medalist from 1896-2012. Criteria such as home country, event, medal, and gender are included in the data. Dataset (xlsx)

NFL stats, 1999-2013 Offensive statistics and personal information (height/weight/dob/combine info/college/conference) for NFL players from 1999-2013 via pro-football- Dataset (xlsx)

Public Data

Airbnb Listings in New York City: 30,478 Airbnb listings in New York City. This data was compiled from Inside Airbnb Dataset (xlsx)

FAA Wildlife Strikes: This is a cleaned table of wildlife strikes from 2000-2015 in the United States. Visit the FAA Wildlife Strike Database which contains records of wildlife strikes reported by airlines, airports, pilots, and other sources. Dataset (xlsx)

UK Big Lottery Fund since 2004 Basic grants data from the UK BIG Lottery Fund, for grants made from 2004 onwards, via .uk Dataset (xlsx)

University Advancement, Donations, and Giving: This is a table of donations made to Universities in the United States. The donation amounts and locations in this data set are not real as they are intended for training purposes only. Dataset (xls)

American University Data: The Integrated Postsecondary Education Data System (IPEDS) is the primary source for data on colleges, universities, and technical and vocational postsecondary institutions in the United States via the National Center for Education Statistics. Dataset (xlsx)

|Science |

|CO2 Emissions by London |Estimates of total CO2 emissions by London Borough, as well as emissions per capita of population (spatial |Dataset |

|Borough (2005-2014) |file for Boroughs and excel of emissions), via data..uk and data..uk (spatial). |(zip) |

|Significant Volcanic |A global listing of over 600 volcanic eruptions from 4360 BC to the present via Significant Volcanic |Dataset |

|Eruptions |Eruptions Database. A significant eruption is classified as one that meets at least one of the following |(xlsx) |

| |criteria: caused fatalities, caused moderate damage (approximately $1 million or more), Volcanic Explosivity| |

| |Index (VEI) of 6 or greater, generated a tsunami, or was associated with a significant earthquake. | |

|Global Active Archive of |Global flood events from 1985 to present via Dartmouth Flood Observatory. |Dataset |

|Large Flood Events | |(xlsx) |

|Magnitude 6+ Earthquakes |All recorded earthquakes with a magnitude of 6 or greater from 1900 - 2013 via USGS (United States |Dataset |

| |Geological Survey) |(xlsx) |

|Lifestyle |

|Star Wars Character Details |Details of Star Wars characteristics including weights, hair colour and birth planets via Github. |Dataset (json) |

|Titanic Passenger List |All the known passengers of the Titanic, where they were heading, what cabin they stayed in, and if |Dataset (csv) |

| |they survived or not. | |

|Top Baby Names in the US |The most popular male and female names in each state for each year from 1910-2012 via the Social |Dataset (csv) |

| |Security Administration. | |

|Cat vs Dog Popularity in the US |Population and ownership by household of dogs and cats broken down by state via American Veterinary |Dataset (xlsx) |

| |Medical Association. | |

|Technology |

|Startup Venture Funding |Information about startup companies, investment, and acquisitions via Crunchbase . Use the Companies, |Dataset (xlsx) |

| |Rounds, Investments, and Acquisition sheets. | |

|Mobile OS Usage |What percentage of the market each mobile operating system had from 2008-2014 via StatCounter. |Dataset (csv) |

|Health |

|Tuberculosis Burden by Country |The World Health Organization estimates the prevalence and mortality of Tuberculosis by country. |Dataset (csv)|

|US County Health Rankings |Ranks US counties on a variety of health factors via Robert Wood Johnson Foundation. |Dataset (csv)|

|Global Burden of disease |Estimates the burden of diseases, injuries, and risk factors globally and for 21 regions for 1990 and |Dataset (csv)|

| |2010 via IHME (Institute for Health Metrics and Evaluations). | |

|Entertainment |

|Eurovision 1998 to 2010 |All the Eurovison entries from 1998 until 2012, including the results of the finals, via the |Dataset (xlsx)|

| |Eurovision Song Contest . | |

|Hollywood's Most Profitable |Title, genre, studio, profitability and ratings for movies released 2007-2012 courtesy of |Dataset (csv) |

|Stories |. | |

|Pokemon Index |Contains attack, defense, speed, and HP stats for all Pokemon numbered 1-718 and their special |Dataset (xlsx)|

| |forms. Via . | |

|Business |

|The 2014 Inc. 5000 |The Inc. 5000 is Inc. Magazine’s annual list of the 5000 fastest growing private companies in the |Dataset (csv)|

| |United States. The list is compiled by measuring each company's percentage revenue growth over a | |

| |four-year period. | |

|Employment Changes in Great |Employment data by industry for 2011 and 2014 by city for Great Britain, courtesy of EMSI, Economic |Dataset |

|Britain by Industry |Modeling Specialists Inc. The 1-digit sheet has data aggregated at the industry level whereas the 2 |(xlsx) |

| |digit sheet has it aggregated at the sub-industry level | |

|Millennial vs Baby Boomer |Employment data in the United States for the millennial and baby boomer generations, broken up by |Dataset (xls)|

|Employment |state, MSA, and industry for 2009-2013. Courtesy of EMSI, Economic Modeling Specialists Inc . | |

Additional Data Sources

Explore other data sets which are publicly available. Don't forget to check that the data is well-structured!

• data.world

• Data is Plural

• UN Data



• Kaggle

• NOAA

• Reddit

• World Fact Book

• UN Environmental Data Explorer

• World Health Organization

• Pitney Bowes

Web Data Connectors

Directly connect to your own data sources that you already use and love! Right-click, copy and paste the link below into Tableau's Web Data Connector to start.

• Fitbit

• Runkeeper

• Blockspring

• import.io

• USGS Earthquake Data

• Facebook

• Currency Exchange Rates

• UK Street Crime

• Tumblr

• Twitter

• Read about data.world's WDC

................
................

In order to avoid copyright disputes, this page is only a partial summary.

Google Online Preview   Download

To fulfill the demand for quickly locating and searching documents.

It is intelligent file search solution for home and business.

Literature Lottery

Related download