This project involves taking AMERICAN football depth charts/player data and putting it into a format that can be worked with programatically.
There are 2 types of data that may be available for each team (in various files in the 'work' folder. The names need to be checked against a list ( to make sure their name spellings match up exactly), and they need to be in a certain format that is outlined in the detailed requirements and the attachment.
## Deliverables
This project involves taking AMERICAN football depth charts/player data and putting it into a format that can be worked with programatically.
There are 2 types of data that may be available for each team (in various files in the 'work' folder:
###################################################
Please see the attachment for the files you will be working with (the data you will be cleaning up is in the work folder, the helper folder has data you need to check against for spelling/first name etc).
###################################################################################
[Starter data charts]
The X axis is the team played, and the Y axis is the player. You can look at these charts and determine which games each player started (Anything with START in the column is a start, ... and XXX can be ignored).
This data I need in the following format:
Season, TeamIdentifier, OpponetIdentifier, Player, Position
an example:
2007, MIC, MSU, Charles Woodson, CB
Here are some important details:
1. The name must be in First Last format. If the file lists the name as Last, First, it needs to be changed. There should be no commas in any of the names. If there is no first name, look in the "rosters" file, find the player, and determine their first name. All names should be queried against this file so that they match the spelling/punctuation of the 'rosters' file.
2. The team and opponet identifier is not the same as it is in the files. You need to find the team in "Team Names and IDs" file and use the Team Identifier that matches.
##################################################################################
[Depth Charts data]
These will be a list of players for team/year by position, along with their backups. I need these names to be in First Last format, and if there is no first name then locate the player in the 'rosters' file and determine their first name. All names should be queried against this file so that they match the spelling/punctuation of the 'rosters' file. The format for this data should be:
Season, TeamIdentifier, Player, String, Position
an example:
2007, MIC, Charles Woodson, 1, CB
2007, MIC, "previous player's backup name", 2, CB
2007, MIC, "previous player's backup name", 3, CB
2007, MIC, Chad Henne, 1, QB
2007, MIC, Nick Sheridan, 2, QB
The team identifier is not the same as it is in the files. You need to find the team in "Team Names and IDs" file and use the Team Identifier that matches.
##################################################################################