The Biggest UZR: Pilot Episode
Defense...
The final frontier...
These are the voyages of the fielding metric UZR...
Its continuing mission - to seek out new skill and new abominations...
Boldly going... where no metric has gone before.
2009 is a remarkable year. For the first time, the system which many consider the best method for evaluating defense is freely available to the public. The metric is called Ultimate Zone Rating, and you can find it on Fangraphs. The accessibility of UZR allows us to pierce the veil of one of baseball's greatest mysteries: defense.
Defense is probably the hardest component of the game for fans to judge. There are several reasons for this. First of all, observation - watching the game - is mostly useless. There is simply too much information for a viewer to process and remember to evaluate defense objectively. Furthermore, it's human nature to emotionalize memory. A disastrous throw by Julio Lugo outweighs all the grounders he gets to with his speed; a gliding play by Jeter lingers in one's mind longer than all the easy liners that pass by him.
Secondly the best statistics to look at defense are not well known or widely available. Most fans get their baseball information from television broadcasts, where they only hear about error totals and fielding percentage, when they hear defensive stats at all. Those measures tell them very little about a player's true defensive value. Great fielders not only execute catches and throws, they also get to more balls than their peers and turn well-hit balls into outs.
Thirdly, fans tend to view defensive skill as a mostly static quality. Many of us, myself included, form judgments about players' defense, and then assume it to be the same every year. Back in 2008, I thought: "Coco Crisp has great speed and makes phenomenal catches, and his defensive ratings were great last year, so he must be a good defensive player this year, too." Unfortunately, players' defense can vary from year to year as much as their offense. For example, Crisp was amazing in 2007, but his 2008 was a real step back. It's much easier to see this with UZR: in 07, Crisp had a career-high UZR of 24.4 at center field, meaning he saved 24.4 runs above an average defender; in 2008, his UZR fell to -8.6.
Finally, for those people who want to study fielding, there's the hurdle of finding, understanding and evaluating the advanced systems. You get a veritable alphabet soup of systems and stats: Range Factor, ZR, RZR, RAR, and UZR. And many of the best stats, including, until recently, UZR, are proprietary information unavailable to fans. Worst of all, the stats can sometimes paint dramatically different pictures of players, meaning you have to either reconcile or ignore certain stats. You don't have to be a math nerd to study sabermetrics, but it probably helps.
The purpose of this column is to track Red Sox defense as the season goes along, using UZR. Why UZR? Well, it's regarded as one of the best fielding metrics, and it is freely available to the public. Why track it? Well, it gives an excuse to talk about defense in general, and it gives us a chance to see how player's performances affect overall numbers.
So enough talk, on to the numbers:
Overall, the Sox are the 7th worst team in MLB by UZR, at -16.5. This is not as bad as it sounds, because the team's been climbing in the ratings, thanks to better defense. IIRC, they were 2nd worst not too long ago. Let's look at the individual players with a minimum of 15 games at a position:
The Good:
Dustin Pedroia 2B 5.0
J D Drew RF 5.0
Nick Green SS 3.3
YOUUUUUK 1B 3.1
Mark Kotsay 1B 1.5
Four of eight positions have above-average defenders. Pedroia and Drew are the cream of the crop, with Green and Youk not far behind. Green's come a long way since the beginning of the season, where his UZR was negative and he committed 9 errors. To the casual observer, he's played better of late, showing great range, making strong throws to first and committing fewer errors. Unfortunately, our best defenders don't make up for the worst ones.
The Bad:
Youkilis 3B -0.5
Rocco Baldelli RF -3.9
Jacoby Ellsbury CF -6.4
Julio Lugo SS -6.8
Jason Bay LF -7.8
Mike Lowell 3B -8.0
The big surprise here is Ellsbury, who I think most of us would expect to be superb in center. He's agile and can make great catches. Lowell's aging and dealing with injury, Bay has never been a good defensive left fielder, and the less said about Lugo, the better.
The Biggest UZR:
Julio Lugo SS at -6.8 (-38.6 per 150 games, worst among regular starters, including Bay and Lowell)
37 comments
|
0 recs |
Do you like this story?
Comments
For comparison:
The Sox are still 2nd worst in the league by BP’s Defensive Efficiency Rate.
"Ninety percent [of my salary] I'll spend on good times, women, and Irish whiskey. The other ten percent I'll probably waste."
-Tug McGraw
Saw that
Link.
Also, Prospectus’ Defensive Efficiency has the Yankees atop the league and #3 in all of MLB. But the team’s UZR is negative at -9.6. Go figure.
However you slice it, Boston’s d has been really bad this year. If defense were a less opaque subject, we’d be clamoring for trades that improve it (Jack Wilson, etc.).
"It's just a tiny little nick, but it hurts when I get champagne in there."
- Jason Bay, on getting spiked scoring the winning run in ALDS Game Four.
In related news, UZR may quickly become outdated:
http://www.beyondtheboxscore.com/2009/7/10/944438/nytimes-the-next-wave-of
OverTheMonster - ALLERGEN WARNING: May contain peanut butter.
Hix posted a link to in the FanShots too.
OverTheMonster - ALLERGEN WARNING: May contain peanut butter.
With all due respect...
…this, and plenty of other discussions of this kind on OTM seem kinda ridiculous. Do you all actually think about numbers and crap when watching the Sox? I mean, honestly, how fun can that be. Personally, my views on defense go like this: If someone fucks up or doesn’t hustle out for a play, I’m gonna be pissed and curse the little men on my TV. If someone makes a great play, I’ll probably pump my fist and say “Fuck Yeah!” – or something like that.
But come on guys (and gals?). I know baseball is the “thinking man’s game,” but isn’t it better to just enjoy it as you see it, instead of always referring to DASTNGASA (that means "Dumb Ass Stats That Nobody Gives A Shit About).
And remember I said with all due respect…
"Well let me just quote the late-great Colonel Sanders, who said...'I'm too drunk to taste this chicken.' "
-Ricky Bobby
For the record
I do care about some numbers, but I only go as far as OPS and WHIP…none of that other ridiculous Bill James garbage.
"Well let me just quote the late-great Colonel Sanders, who said...'I'm too drunk to taste this chicken.' "
-Ricky Bobby
Personal issues.
That’s all I’m gonna say.
OverTheMonster - ALLERGEN WARNING: May contain peanut butter.
Actually
I am going to defend nepats (a bit). First, when I watch the games I don’t really think about UZR, etc. I simply wathc the way (a bad) scout would-eg try to see if I think someone is playing well or not. Who knows if the sabrstats are influencing these perceptions or not. Otherwise I just enjoy the game, especially if the Sox win.
Second, OPS, etc are rediculous Bill James stats, even if they are not sexy. Third, I think that defensive metrics are stone age stats. It is not that one should not try to gauge defensive performance via mean other than errors, but it is so damn complicated. I would like to see a study of how something like FIP+UZR(team) (and these things seperately) correlates with total runs allowed. Would be easy to do.
Not sure it would be anything like OPS vs runs(team).
It's been done, I'm sure.
Ask someone over at Beyond the Boxscore.
OverTheMonster - ALLERGEN WARNING: May contain peanut butter.
actually
I don’t think it has (at least last time Kalkman was on this site we had a back and forth about it). It should be done, but I doubt it has. Many things should be done that have not-for example, do they break down UZR by home/road? If so, I am not aware of it. Why not? It would allow us to know how good of a metric it is and how variable it is to the quirks of the park (eg Fenway).
Hmm.
I think the fact that it just recently became freely available is holding people back. Maybe they don’t want to have it pulled if a flaw is found, putting us back in square 2 for defensive analysis (new technology, publicized today, aside). I don’t really know though.
OverTheMonster - ALLERGEN WARNING: May contain peanut butter.
Defending numbers guys:
During a game, I 100% agree with you. I generally think about nothing other than how much I want the next guy to get a hit (or not, depending). But, personally, I really enjoy reading and thinking about baseball when the games are not going on. Like many of the people who comment and post around here regularly, we enjoy analyzing the game as accurately as possible. This means that we welcome any stat that might help us do that. We could all just make unsubstantiated declarations about someone being a “sick fielder” because “he had this diving catch the other day that was awesome!” or we could use the information available to try to reach unbiased conclusions about a player’s ability.
And why do you like OPS and WHIP but not other ones? This is what I’ve never understood about people who are “anti-stats”: they do not shun all statistical analysis, but only the ones that might take a little more thought. Though your use of OPS is quite progressive for someone who refers to Bill James’ work as “garbage.”
So we do enjoy it as we see it, we just like to learn things that will hep us enjoy it even more.
"Ninety percent [of my salary] I'll spend on good times, women, and Irish whiskey. The other ten percent I'll probably waste."
-Tug McGraw
Really?
I think about stuff like UZR during games pretty often – well, when I can actually see the game, and not on Gameday. If a runner is being held and the ball is likely out of the fielder’s range, then I generally have the same reaction as you guys. It’s about knowing when to be angry and when to cheer, I guess.
OverTheMonster - ALLERGEN WARNING: May contain peanut butter.
Thanks
This is a sincere response BTW.
I really appreciate an honest answer to my question (regarding the numbers while watching) so I will reply with an honest answer to your questions.
1) OPS and WHIP I like one, because I use them in Fantasy Ball and two, they are far more easy for me, a known hater of math, to figure out. Also, most sports journalists tend to pay more attention to these stats (which as far as I can see, make perfect sense for seeing a players’ value).
2) I really don’t think Bill James is “garbage,” I was just ranting, which I tend to do.
I guess ultimately, I wrote my previous comment because it’s hot as hell and I was equally as bored. Thanks again for the reply.
Go Sox! (That’s what we’re all here for in the end).
"Well let me just quote the late-great Colonel Sanders, who said...'I'm too drunk to taste this chicken.' "
-Ricky Bobby
True.
It’s just that OPS and WHIP have issues. OPS, for one, values OBP and SLG the same, while OBP has been shown to coorelate with run scoring better. An improved version of OPS is wOBA, which is calculated so that you can judge it like OBP. An easy way to calculate it is (2*OBP+SLG)/3, or, to be more accurate, multiply OBP by 1.7 instead. That’s a stat that generally allows you to judge the difference between a batter like Papi and a batter like Ells.
WHIP is okay, but some pitchers will always allow more walks and hits than others (cough-WAKE-DICEK-cough). Plus, it counts hits that were more the fault of the park (Monster knocks) or bad fielders (not necessarily error prone guys, just slow guys) the same as hits that were allowed by the pitcher (homers, line drives, etc.). FIP (Fielding Independent Pitching) fixes that, to an extent, and tRA is better as well.
OverTheMonster - ALLERGEN WARNING: May contain peanut butter.
Ow
My head hurts now…
"Well let me just quote the late-great Colonel Sanders, who said...'I'm too drunk to taste this chicken.' "
-Ricky Bobby
Sorry.
Just giving you an option to learn more. You should, at the very least, be ready to Google a stat that we may mention if you’re not familiar with it – maybe just what good, bad, and average are considered for that stat. FanGraphs’ graphs are great for that.
OverTheMonster - ALLERGEN WARNING: May contain peanut butter.
Much appreciated
But that would be more work than is needed for me to enjoy the game…maybe I will check them out though, just to be on the same page.
"Well let me just quote the late-great Colonel Sanders, who said...'I'm too drunk to taste this chicken.' "
-Ricky Bobby
I kinda hate talking about wOBA using the shortcut that you mention, even if it is very close to accurate. The whole appeal to wOBA for me is the way that it is actually calculated; by weighing each event accurately as compared to other events. SLG does not make sense because it says that a double is twice as valuable as a single, a HR is twice as valuable as a double and so one, which is definitely not true. Doesn’t really matter; just my opinion.
"Ninety percent [of my salary] I'll spend on good times, women, and Irish whiskey. The other ten percent I'll probably waste."
-Tug McGraw
Yes, but it's hard to do those calculations without a calculator.
I’d call it the ballpark equation, both because it gives you a ballpark number, and it can be done AT the ballpark.
OverTheMonster - ALLERGEN WARNING: May contain peanut butter.
Not to hog the comments, but I've been thinking about this for a while...
If teams start following the example set by last year’s Rays (and others) by improving their defense, couldn’t that, in theory, completely wreck UZR numbers? In UZR, numbers are scaled where 0 = average, but it also is the replacement level, IIRC (if not, just tell me to shut up). If teams start valuing defense more, that means there will be less good defenders available, which would lower the replacement level. Better defensive players would also raise the average level, so UZR would have to start differentiating between the two.
OverTheMonster - ALLERGEN WARNING: May contain peanut butter.
I do not think average is equivalent to replacement level. I’m not sure replacement level is defined in UZR. Generally there are probably “freely available players” i.e. replacement level players, that field better than many guys who record negative UZR’s but are not freely available (because of their offense).
But your main point is correct. UZR is relative to the talent at that position at that time. In that way it is automatically era-adjusted. But there is a fixed number of spots available, so there will always be the same number of defenders available. And if defense starts to be valued differently, there will probably be less available good defenders, but probably more good offensive players available.
"Ninety percent [of my salary] I'll spend on good times, women, and Irish whiskey. The other ten percent I'll probably waste."
-Tug McGraw
That's what I meant.
All the stuff I found on UZR said that average was effectively replacement, but I guess that refers to in general, not year to year.
OverTheMonster - ALLERGEN WARNING: May contain peanut butter.
By the way, I voted for option 2 (I think, unless I accidently voted for option 3), but it took a lot of willpower not to vote 5)
Do you know who Justin is?
Just wondering… Google UZR if you want to find out how it works, there’s several articles on FanGraphs and similar sites.
OverTheMonster - ALLERGEN WARNING: May contain peanut butter.
Here's his blog, I think.
http://www.basement-dwellers.com/
He also writes for Beyond the Boxscore from time to time.
OverTheMonster - ALLERGEN WARNING: May contain peanut butter.
I think we all know that Fenway is not a park for UZR.
All the hits off the Monster fuck up Ellsbury and Bay’s stats. With that said, I don’t think our defense can’t still improve. Bay has been shaky, obviously SS, Lowell has lost a step, and Pedey and Youk have been mortal. I think the best thing to do is wait for it to come around.
"Hating the New York Yankees is as American as apple pie, unwed mothers and cheating on your income tax." -- Mike Royko
This is interesting but I am skeptical
The past two seasons I have begun to research more stats an try to keep an open mind. WHIP and OBP make sense and are a fairly realiable metric. However, I need some help. How can this stat truly measure objectively when even a cursory observation of Ellsbury reveals that he rarely makes an error, has excellent range along with good situational awareness. His only ding would be that he cannot throw like Dwight Evans. So, someone please explain his low score and how this metric is actually sound analysis.
"You take a team with twenty-five assholes and I'll show you a pennant. I'll show you the New York Yankees." - Bill Lee
You can't rely on one season to judge a player with UZR, that much has become evident.
Averaging 3 seasons seems to be the best approach, IIRC. Ells has issues, like the weak arm you mentioned, but he also may be playing in weird spots that are allowing more hits to drop. Fenway really seems to be UZR hell.
OverTheMonster - ALLERGEN WARNING: May contain peanut butter.
The weird thing is that Coco had such a great year in 2007 followed by such a bad year in 2008 in CF. It just does not make sense to me.
"Ninety percent [of my salary] I'll spend on good times, women, and Irish whiskey. The other ten percent I'll probably waste."
-Tug McGraw
Getting older?
More chances to expose the noodle?
OverTheMonster - ALLERGEN WARNING: May contain peanut butter.
That is a cogent statement (both average and the Fens factor)
I do think this – one way to judge a player’s value is when he is dangled in trade possibiblities. let be clear – I am not saying to trade ELLS – no way. What I am saying is he would most likely be an upgrade in CF for many clubs. He is a good all around player who is improving. What kind of metric that would entail is absolutely subjective…one thing for sure I am glad we have him on our team
"You take a team with twenty-five assholes and I'll show you a pennant. I'll show you the New York Yankees." - Bill Lee

by 
























