Submit a Story!
Get the BallHype iPhone App
Season similarity scores
Season similarity scores
Ichiro Suzuki and Al Wingo? Order the Hardball Times Annual 2009 today !
4 Comments
  • sdanne sdanne
    +1

    Nice job, although I didn't understand the reasoning behind squaring the linear weights values of each event. And I'm not sure how GIDP or CS are positive relative to a single, either.

     

    You should read this article from almost a year ago...

    http://statspeak.net/2007/11/stats-204-the-proximity-matrix-or-re-visioning-similarity-scores.html

    There's a "Zach" commenting on that thread, not sure if it's you or if it's just a coincidence. Two different ways to attack the same problem. 

    Posted 11/21/2008 respond (flag)
  • zwalters zwalters
    +1
    I'm the Zach in the comments from that previous article, as you suspect.  It took me a while to follow up because my first SQL script was terribly inefficient.
    Posted 11/21/2008 respond (flag)
  • jcdorhauer jcdorhauer
    +1
    Good stuff. It is articles like this that keep me logging on daily to THT - simply the best at analysis, history, and context. I would love to see the comps on Bob Gibson's 1968 season: 13 shutouts and a 1.12 ERA. I note that all of your comps are for offensive stats - which begs the questions about a parallel article for pitching comps.
    Posted 11/22/2008 respond (flag)
  • ger8ry ger8ry
    +1
    The trouble with Arlie Latham & Hugh Nicol is that stolen base didn't mean then what it does today. Those guys could be credited with a steal for going from 1st to 3rd on a single.
    Posted 11/23/2008 respond (flag)
Blog Reactions

Run-based similarity scores
THE BOOK--Playing The Percentages In BaseballGreat work… to which I disagree.  Pizza Cutter did similar work based on rate stats, to which I have lots of comments on his thread.  My key point is this: If you are interested in looking for similar players to Vince Coleman, you may insist that the speed components (3b per 2b+3b and sb per sbOpp) be weighted much more than you otherwise would, because you are really interested in the speed players mostly. So, in a run-based system, the speed components simply won’t have much ...

Related Content
Whoops!
lookoutlanding.com 11/17/2008 — Geoff Baker, September : When it came to Ichiro, who got off to a typically slow start in April and part of May, the internal turmoil nearly hit its boiling point. "I just can't believe the number of guys who really dislike him," said one ...
Boorstein: Cano Has To Draw More Walks
slidingintohome.blogspot.com 11/22/2008 — Cano's rough 2008 has been talked about a lot, and it's left many fans wondering whether Cano will be as good as we all thought he was going to be. And it's left others wanting the Yankees to ship him out of town. I wouldn't worry so much. Yes, it was a down year, but it's no reason to give up ...
Seattle Mariners Are The Next Baseball PowerhouseBleacher Report - MLB 11/25/2008
Yes, I said it. The Seattle Mariners are the next baseball powerhouse. In 2008 the Mariners were plagued with injuries. In 2009 the Mariners can win the West. In 2007 the Mariners were in contention for the playoffs, fighting with the A's, ...