October 20, 2008

Plug for the SABR Minor League Encyclopedia

Steve Treder penned a complimentary post about the SABR Minor League Encyclopedia in Hardball Times today.

Labels: ,

October 7, 2008

SABR Minor League Encyclopedia Update

The SABR Minor League Encyclopedia has been updated on Oct. 6. The next scheduled update will be in January. The Encyclopedia may be searched at: www.minors.sabrwebs.com

Labels: ,

September 4, 2008

SABR Minor League Encyclopedia Update

The next update for the SABR Minor League Encyclopedia will be on October 1. Among the additions will be statistics and data for the 2008 season.

I will do my best to have the committee newsletter published this weekend. I know I've promised it since July, but I've been very busy the past two months. Thank you for your patience.

Labels: , , ,

March 18, 2008

Volunteers Needed for Minor League Ballpark Data

Ted Turocy, technical coordinator for the SABR Minor League Encyclopedia, is seeking volunteers to work on minor league ballpark data for the Encyclopedia. He writes:

"Gord Brown has generously donated his extensive research (joint with Ted Lukacs and others) on minor league ballparks to the project. I'd like to get going on integrating this into the database. To this end, I could use a couple volunteers to take his original Word documents and produce a spreadsheet version of them.


It's pretty straightforward work, not terribly exciting, but requires a little bit of comfort with using a spreadsheet (since using the "fill handle" is very helpful for this task).

If you're interested in helping out, please contact me here."

Labels: , , , ,

March 11, 2008

Updates to SABR Minor League Encyclopedia (March 10)

The SABR Minor League Encyclopedia was updated on March 10.

Ted Turocy, technical coordinator for the Encyclopedia, shares what's been added:

* We've built out 19th century rosters for players with last names starting in "Y" and "Z", based on Reed Howard's lists, and through Cliff Blau's efforts.

* Ed and I (well, Ed gets most of the credit here) have been able to develop a process to synchronize changes from his database to the main one. We tested this out last week with players with last name "A," so we've added a bunch of updates for those players.

* I'm now in receipt of all Paul Porter's statistics spreadsheets. I have added additional statistical columns based on these to the 1903 AA and EL, by way of testing things out.

* Between Paul's spreadsheets, data entry I had already done a few years ago, and data entry by a few volunteers, we will have expanded statistical coverage of most of the AAA and AA leagues in 1981, as well as whatever other leagues from the 1970s I manage to get done today.

* Further work on identifying managers from Jerry Jackson's data with players in the database. We've been able to match up about 40% of the managers so far. The rest will go much more slowly, because Jerry's data lacks birth and death dates for most of these guys -- so we either need to look for player-managers (of which we are finding a lot), or use other resources to match up former players with their managerial careers.

Labels: , , ,

February 26, 2008

Call for Volunteers (Revised) for Paul Porter Data

Ted Turocy, technical coordinator for the SABR Minor League Encyclopedia, has posted a request for volunteers to work on data contributed by Paul Porter:

Hi again all,

I'd like to put out a "revised" call for volunteers to help move the process of getting Paul's spreadsheets incorporated into the database.

I have been working on some tools to automate much of the ID mapping process I was originally looking for. It turns out some pretty simple tests can identify around 85-90% of the players in a typical spreadsheet. Most of the rest are spelling variations, or other ambiguities (two players with the same name on the same club, etc.)

So, I'd like to break up the process into two parts. The first thing that we need to do is to restructure the team and league columns in the spreadsheets. I've just uploaded a file called
"1903-stdclubs.xls" which illustrates the process using 1903.

What has been done:

(1) Create three columns for team names, Team1, Team2, Team3. For multi-club players, enter the teams he appeared for one per column. (There are very rare instances of 4-club players. If this happens, just leave the original club entry in Team1. I will process these specially, there are so few.)

(2) Make sure the team names match the names in the database. Some common divergences include things like "St Paul" (we use "St." with a period), "Ft Worth" (we always spell out "Fort"). Check the website if you're not sure.

(3) Expand the league abbreviation out to the full league name as we have it in the database.

You should feel free to sort the spreadsheet if it helps -- just make sure that if you do, your spreadsheet selects all the rows and columns. Some spreadsheets have column C blank, which would mess things up. (By that way, if this does happen, my programs will freak out later on -- so we will know something went wrong. Don't be afraid that a bad sort will rewrite history!)

I am going to be traveling the next few days. If you're interested in doing this, feel free to just post to the list which years you've taken. If you're any good with spreadsheets, you can probably
manipulate one year in 15-30 minutes at most. Once you've finished them, send them to me off-list -- we will wind up using most of our 100Mb limit in the Files section pretty quickly otherwise.

Once this is done, I hit the spreadsheets against the database to map players. Then, the next task will be for a human to look at the unmatched players and determine why they failed to match. This should be a small fraction of the total players in the database -- much easier than having to do the whole shebang.

Ted Turocy
Technical Coordinator, SABR Minor League Encyclopedia

Labels: , , , ,

February 25, 2008

SABR Minor League Encyclopedia

The SABR Minor League Committee is pleased to announce the launch of the alpha version of the online SABR Minor League Encyclopedia. It can be found at: http://minors.sabrwebs.com


The SABR Minor League Encyclopedia project is led by Kevin McCann (Committee Chair), Kevin Johnson (Quality Control Coordinator), and Ted Turocy (Technical Coordinator). The foundation for the database is the donated work of Ed Washuta. Others who have contributed to its launch include Mike Emeigh, Frank Hamilton, Reed Howard, Rod Nelson, Paul Rivard, and Tom Ruane.


The database features individual batting, pitching, and fielding statistics (complete and incomplete) for affiliated and independent leagues between 1901 and 2007, plus league standings and statistical totals. Statistics and data for 19
th Century leagues donated by Reed Howard will be added soon.


Later this year, the database will be integrated into version 2.0 of the online SABR Baseball Encyclopedia, allowing researchers to view both major and minor league records for individual players. Because it is an alpha version, there are errors and gaps in player records. However, researchers can use this version to determine what information is already available, and what information needs to be supplemented or added.


Volunteers to enter data, identify and correct errors, and contribute biographical information are needed to help move the project forward. SABR has many researchers who are experts for specific teams, leagues, regions, and eras. Some have compiled statistics using box scores for leagues that were not included in the Guides. Donations of such research will help make the database more complete and useful for the SABR community.


If you would like to volunteer, please contact Kevin McCann or Frank Hamilton
. Volunteers can also join the SABRminorleaguesDATA eGroup. To subscribe, send an e-mail to: SABRminorleaguesDATA-subscribe@yahoogroups.com

Labels: , , , ,