web analytics
<
Tag Archive - analysis

Post-Trade Deadline Performance by the Numbers

Wyn Douglas peaks in HP writing right around the deadline. But the rest of the year he writes for and manages Timberwolves blog Canis Hoopus and curates his personal site, complete with NBA Player Movement Flow Charts. Cut him some slack, the Wolves have been rebuilding for about 5 years and it’s tough to ensure you’re criticizing the right person for the wrong move.

When approaching the NBA trade deadline, there are 3 primary approaches for franchises to take:

  1. Get busy livin’. These are your Dallas and Cleveland types. While both have already made big splashes this year, both are also regulars in the deadline blockbuster scene. Those that get busy livin’ are the teams looking to add firepower for a championship run, rarely sparing an expense. Cap space? Draft picks? A contender craves not these things.
  2. Get busy dyin’. These are your Washington (at least this year) and Memphis (at least in 2008) types. As you can infer, they are the opposite of the first group. These are the losers smart enough to know that the guys they’ve banked on for the last few years ain’t gonna cut it and if they hold any value for the franchise, it’s as currency. The team will reluctantly hand over their above-average players, in talent and age, in order to clear money and time for the youngsters they have and hope to acquire. It’s not pretty, but in a league of guaranteed money, it’s often the best course of action.
  3. Don’t get busy. While passing on significant deadline activity is one approach, there’s more than one type of team that chooses it.
    • There are those who are just as happy to facilitate another team’s transaction as they are to do nothing at all. Let’s call these the Oklahoma Cities.
    • Then there are the franchises who think swapping Von Wafer for Taurean Green is worth the time it takes to file papers with the League office. A number of different teams are up to this each year, so I won’t single anyone out by sarcastically branding them.
    • The final group, however, are the true sad sacks that think it’s best to let a mediocre core that’s gotten you 1st round playoff exits for the last few years ride. Let’s call them the Old Washingtons.

Continue Reading…

NBA HD: A Look at At Rim Defense by Team.

Team shot location data has been the focus of my last two articles and since it’s largely uncharted territory, I thought I would continue down that path for the third part in the series.

My last piece that ran a few days ago looked at how teams shoot depending on the varying opposing defenses.  I found that generally teams attack the basket with more frequency as the quality of at rim defense worsens, but the degree in which they exploit that weakness was surprisingly small.  Teams averaged 25.7 shots at the rim against good at rim defenses which is one shot fewer than their per game rate against poor defenses.  Put another way, it’s also fair to suggest that good post defenses curtail shots at the basket. Moving further away from the basket, I did not see increased shot selection from 16-23 feet or from beyond the 3-point line.

Today, I’d like to turn the tables and tighten my focus to the most crucial part of the court: the basket.  How does each at rim defense perform against varying at rim offenses?   I broke down the offenses in the same manner that I did in the previous two pieces: top 10 teams classified as “Good”, bottom 10 “poor” and middle 10 “average.”  For the sake of consistency, I have maintained the color formatting from the last post.  So, if a team has a red color fill, that means they have allowed a low field goal percentage in that area which is the objective of any defense.  Red does not mean “bad”; instead, it merely indicates poor field goal percentage.   Let’s take a look.

Defenses can’t keep good at rim offenses from converting at a high percentage (.637) but interestingly, good at rim offenses shoot the fewest number of shots of the three groups (24.7 per game).  That good at rim teams do not attack the basket more often runs counter to the conventional wisdom.  Wait, I don’t think I have worded correctly.  Let me try again: good at rim teams do not attempt more foul-free shots at the basket.  That’s a key distinction.

At rim defenses, more than any other zone, are prone to deflated shot frequency due to fouls.   The biggest shot drop off comes from the Pistons defense that allows 7.4 more shots at the basket when facing a poor offense compared to a good one.   That’s a 27.5 percent decrease.  As you would expect, the Pistons foul a lot as their opponent Free Throw Rate (FTA/FGA) ranks sixth highest in the NBA.   So it’s probably more accurate to suggest that good at rim squads do attack the basket frequently but the whistle hides a good portion of those attempts.

The Pacers feast on poor at rim offenses more than any other good defense, with an 80 point swing from good to bad offenses while still maintaining a below-average opponent field goal percentage in all three.  Like the Pistons, this could be a result of their high fouling rate, as they rank as the second most foul prone team according to FTR.   Good offenses get to the line whereas poor ones fail to draw the contact that would alleviate their missed shot count.

Actually, all of the top defenses rate well across the board here– except for the Lakers.

The Lakers plummet 14 spots in the ranks  going from poor offenses to good offenses as their field goal percentage skyrockets from .549 to .656.  It could prove fatal in the playoffs once you consider that  most playoff teams excel at the basket.  Although, the Lakers did stop the Spurs recently last Monday, holding Tim Duncan and the rest of the Spurs to just 16-34 shooting at the basket, which serves as a sharp contrast to their 18-24 FG showing on January 12th.    This will be something to watch as the season goes on.

In the sheer absurdity department, good at rim offenses have scored 70.6 percent on the Kings this season.  Sacramento can place a lot of the blame on San Antonio and Atlanta.  In their three contests, the Spurs have shot 69 for 88 at the basket, which is a staggering 78.4 percent success rate.   Atlanta has done even better: making 42 of 52 (80.8%).

As always, you can find exclusive shot location sortable breakdowns at Hoopdata.com.

Evaluating The Playoff Race At All-Star Break Through Four Factors: Eastern Conference

We’re at the halfwaypoint, and some things we know, and some things we think we know. We think we know how good the Cavs, Lakers, and Nuggets are. We know how bad the Nets are. We know that the Western playoff race will likely come down to the end. We know the Hawks have the Celtics number, and the Magic have the Hawks number, and everyone has Golden State’s number. What we don’t know is just how good the playoff teams are, and if their record is befitting their performance. So I decided to take a look.

Continue Reading…

NBA HD: Analyzing Shot Location Matchups by Team

Last week, for my debut article here at Hardwood Paroxysm, I used shot location data from my mother site Hoopdata.com to see whether good shooting teams from a particular area from the floor enjoy an advantage against weaker foes in that shot location.  I found some compelling results; teams who dominate around the rim have generally shot better against weak post defenses, and vice versa.  I also discovered that a good three-point shooting team has not exhibited much of an improvement between facing a poor 3-point defense and an average defense, which was the only area to show that type of diminishing return.  So a team like the Suns, who terrorize opponents from downtown, may not see their deadly shooting talents magnify against a poor three-point defense.

My findings provided enough juice to take this a step further.  This time, I’d like to amp up the focus and look at specific teams this year.  I replicated the method I used last time and grouped the teams in tens depending on their opponent field goal percentage in each shot location.  I tossed the top ten teams into the “Good” pile, the bottom ten in the “Poor” pile and the rest in between were classified as “Average”.  You can find all this good info in the team shot location pages over at Hoopdata.  If you’ve read my work before, you know I really, really love adding colors to charts and graphs.  Each color has been formatted like a heat map to correspond with their relation to the group norm.  Looking at the FG% column, green fill illustrates a team’s excellent standing in that particular area and the opposite holds for red fill.  As it follows, yellow fill indicates a number close to the norm.  For the FGA column, the gradient ranges from yellow to burnt orange.   The colors should help you identify the extremes and digest the numbers faster.  The following chart shows how each team performs at the rim (layups, dunks, and tip-ins) when facing a good, average, and poor defense in the same area.

We’ll start with the Cavs.  They don’t miss at the rim, no matter who they face.  Led by LeBron James’ 73.8 at rim FG%, the Cavs shoot better at the rim against good defenses than 21 teams fare against poor ones. They shot 19-24 vs. the Lakers on Christmas Day.  In their two games against the 5th-ranked Pacers at rim defense, the Cavs nailed 30 of the 39 attempts at the basket.   LeBron’s finishing abilities are contagious in the sense that he pulls help defenders toward him as he drives, opening up easy buckets for his teammates cutting to the rack.

Looking elsewhere, the Raptors and Spurs are the only other two teams who finish above-average against all three groups while the Bobcats, Nets, and Bucks struggle against all defenses.  The Celtics wilt before top at rim squads but otherwise, they finish with gusto.  A lot of their problems at the rim would be solved if they never have to face the Magic.  In three of their four games against the Magic this season, the Celtics have missed 42 of their 73 chip-ins.  Granted, the Magic have the best post defense in the league but we’re talking layups here.

Do teams take more shots against poor defenses?  Not many, if at all.  On average, teams shoot 25.7 shots at the basket against good post defenses which is only one fewer than the rate against lesser defenses.   So the difference is marginal on the whole.  Interestingly enough, the Lakers don’t seem to be taking advantage against the weaker teams and actually take fewer shots at the rim in those scenarios.  Their .610 field goal percentage against poor defenses ranks 6th-worst in the NBA.  Kobe Bryant, who shoots 57.5 percent at the rim,  shares some of the blame for the Lakers woes.  He is actually a below-average finisher at the basket compared to his shooting guard positional comrades who average 59.8 percent.

Moving away from the basket, let’s take a look at how teams shoot on long twos.

Despite experiencing a sizable 22 point advantage in field goal percentage, teams don’t really look to take more long twos against poor perimeter shooting teams; teams average 0.4 fewer long twos against poor defenses than their portion against good ones.   It’s harder to distinguish between a good defense and a poor one in this range because the spread is so small.  The 10th best long two defense allows 40.2 percent whereas the 10th worst allows 38.6 percent.   Not a huge difference.

Nonetheless, some teams have really struggled against good perimeter teams.  Houston plummets from a healthy 42.2 FG% down to a league-worst 34.6 FG% as the going gets tougher.   Although, their distaste for long twos keeps that damage to a minimum; they take the second-fewest long twos in the game

It’s hard to fathom how difficult it is to shoot nearly 50 percent from this range but the Mavericks somehow manage to do it against bad perimeter defenses.  Led by long two resident Dirk Nowitzki, the Mavs have shot 50 percent or better 17 times this season from that area.  The Bulls?  Three times.  And they take average eight more heaves per game.

Let’s move along to our final destination: behind the three-point line.

Just as we saw with long twos, teams don’t tend to take more threes against poor 3-point defenses.  In fact, teams on average attempt more shots beyond the arc against good defending foes (18.3) than the average (17.3) and poor (18.2) opponents, despite knocking them down at a higher rate.

The Cavs are on top of their game in the most efficient areas on the floor.  Just as they were unfazed at the rim, Cleveland shoots well from downtown no matter who they face.  Not all 3-point shooting teams weather the storm like the Cavs however.  Contrast the Cavs with the Suns, another sharpshooting team, whose three point shooting numbers fall from .428 to .379 as their opponent improves.   Count Denver in that group, too.

With a .454 field goal percentage beyond the arc against poor 3-point defenders, the Spurs effectively shoot .681 with the added one point bonus.   That’s an amazing figure considering it’s more than 100 percentage points above the rest of the league, in terms of effective field goal percentage.  If the perimeter exploitation continues, the Spurs hope to draw either the Suns or Mavericks  come playoff time as they both rank among in the bottom ten in 3-point defense.  Otherwise, the Spurs are a below-average shooting team from beyond the arc.

While teams enjoy an advantage against the different quality defenses, they don’t launch more from the perimeter depending on their opponent.  Since the likelihood of nailing a shot from downtown is much smaller than at the basket, the perimeter ranks may be more random variation than true representation of strength.

Curious about the short and mid-range numbers?  Take a look at them here and here.  I saved them for the sake of  space but they’re definitely worth a look.

The Arbitrarian: Envisioning the Olympics

David Sparks is The Arbitrarian. He profiles his stastical work every Thursday here at Hardwood Paroxysm. David is glad to be back at school, especially with his new Trapper Keeper and abacus. This week he takes a look back at Olympic Basketball and the ramifications the numbers supply within.

Admittedly, it’s a little late for Olympic basketball coverage, given that the competition ended sometime around 4:00 am EST on Sunday morning, but Thursday is Arbitrarian day, and so today I’m going to try to tell the story of the US Men’s Olympic basketball team retrospectively, in statistics and graphics.

Predecessors

This most recent iteration of the US men’s basketball team was slated to “redeem” the American program in international competition. After several successive failures to dominate their competition, much was made about the degree to which the rest of the world had caught up to the level of American basketball and/or how the American players, because of would no longer be able to dominate in international competition. Several of the more recent US squads were derided as selfish, non-fundamentally sound, failing to take international competition seriously–the narrative was one of how hubris could lead even the mightiest to fall.

It has been said that during those dark years, the US was “just fielding all-star teams,” and that part of Jerry Colangelo’s plan for a return to dominance was to field carefully constructed teams, with role players and specialists–not just 12 guys who could score. To what extent is this true? How much credit does Colangelo’s craftsmanship deserve? As we like to do here, let’s take this subjective claim, and apply a little bit of rigor to see if it holds up without the patriotic feelings and stirring redemption narrative clouding our judgment. For answers, let us look to an application of the SPI style trichotomy:


(Note: If you turn captions on (second button from left on bottom), each diagram is labeled with its year. Also, hit pause and use the arrows to review each image at your own pace.)

Above is a series of graphics depicting the SPI styles (based on their NBA statistics) of each team fielded by the US in major international competition, from the Dream Team in 1992, to this year’s “Redeem” Team, with the exception of the 1998 World Championship team, which was largely composed of non-NBA players.

What differences can we identify in each team’s composition? Did Colangelo really put together a thoughtfully composed team? It appears to me that this was at least some part of the difference between this year’s team and those recent teams that ended in failure and disappointment. The main thing I notice, in comparing the 2002, ’04, and ’06 teams (although especially the first two) to each of the others, is a relative dearth in the pure perimeter region.

Each of these teams has an eclectic smattering of interior types–some years they appear more offensively-minded than others, and the 2008 Olympic team, interestingly has only three players classified as such in the SPI scheme. But look first at the 1992 team, which is stacked to the gills with players in the 10 o’clock to 12 o’clock range, meaning that their statistics indicate a focus on perimeter play, or an absence of focus on scoring, relative to the league. Such is the case, to a slightly lesser extent, with each of the other teams up through 2000.

In 2002, the perimeter appears to have become less of a priority, stocked with
Andre Miller, Davis, and young Jay Williams–good players, but not the “pure point” types which manned some of the other teams. Further, that team was full of Perimeter Scorer types, three of which (Reggie Miller, Finley, and Allen), are known more for their shooting than their all-around game.

2004 may have been an even more poorly-constructed team, with essentially no Pure Perimeter players. James and Wade are capable of facilitating, but this is not typically their primary role, and James played relatively few minutes anyway. Instead, that role was left mainly to Marbury and Iverson, who are known to look for their own shot as often as they pass–and this subjective reputation is backed up by the SPI analysis.

The 2006 team was much better–it is obvious that effort was made to compose a team of players of many different types–this is the only year in which there is at least one player from each sextant of the SPI plot. This is not necessarily a good thing for winning, but it indicates that thought was put into how each player would fit together into a whole. Further, two actual perimeter players were included, Paul and Hinrich, and this team performed substantially better than their Marbury- and Iverson-lead predecessors.

This year’s team sees a return to past glory, likely in no small part to a fully-stocked trio of Pure Perimeter players, able to push the ball up court and facilitate any of the able scorers on the team. Interior play was de-emphasized, as the team’s focus would be on a disruptive defensive style aimed at generating turnovers and leading to fast breaks–for this, speed, not size, was key.

In sum, it appears as though part of the credit for the USA’s Olympic success really might belong to Mr. Colangelo. Though it is the players on the floor who do the actual winning and losing, a large part of the results likely stemmed from what happened way before the opening tip.

Now that we have covered the pre-Olympic preparation phase, let us turn our attention to what actually happened in Beijing.

Assessing productivity in these Games

Due to limitations on the ease with which game-by-game data can be collected for the Olympic tournament, I will be discussing productivity (as measured by MEV) rather than value (as measured by MVP)–but here, the story is pretty clear.¹ Below is a list of each athlete, with their SPI factors, points- and MEV-per game numbers, and Valuable Contributions Ratio. I’ve also included what I call Points Per Points Possible (p4), which divides points scored by the number of points possible on each of their shot attempts (2 for all field goal attempts, plus an extra one on three-point attempts, plus one for each free throw attempt).


Many of the most productive individuals play professionally in the NBA. These numbers indicate that LeBron James was the most valuable to Team USA, but note that Wade was almost as productive in substantially fewer minutes (his VCR is the highest on the US team). As such, I have to name James the MVP (for the team and the whole tournament), but Wade is the US’s Most Efficient Player, which is exactly what the team needed from its first man off the bench.

How did contributions break down for each team? Below is a series of charts that plot the sources of production for each team, based on tournament-cumulative MEV. Each player is colored according to their SPI type, and players with negative MEV are zeroed out (because it’s hard to depict the area of a negative number):


Click here if you want a whole window full of these pie charts.

Among the best teams in the competition, Argentina was more highly dependent on their top-tier players than were Spain and the US. The two teams most reliant on a single player were China, anchored by Yao Ming, and Iran, lead by Ehadadi. Croatia appears to have had the most balanced contributions, although this is often a trait of weaker teams, because it is easier to field a team of equally poor players than one of equally excellent players.

What did each player produce individually? The table above gives the summary report of the points-value of each player’s production, in the form of MEV. Below, however, I have the complete breakdown of each player’s counting statistics for the Olympic tournament, as a percentage of the simple sum of these stats for that player. I have tried to arrange the graphs such that adjacent areas make for easy comparison of paired statistics–missed field goals is next to points, assists next to turnovers, offensive and defensive rebounds together, followed by the defensive statistics, etc. Players are sorted by MEV/gp. Coloration is of course derived from SPI type based on Olympic statistics.


Click here if you want a whole window full of these little pie charts.

Seeing these pie charts all together as small multiples allows us to easily compare two or more players. Note, for example, that Dwight Howard and Chris Bosh were almost perfect substitutes for one another: they have almost identical per-game MEVs, and their stat distributions look very similar–the only exception seems to be that Bosh seems to have grabbed relatively more defensive rebounds and turned the ball over slightly more, while Howard did a lot more fouling.

Carlos Delfino’s SPI color identifies him as a very tournament-representative player; that is, his relative distribution of scoring, perimeter, and interior statistics reflect that of all players collectively. The gray color indicates this league-relative neutrality, and he serves as a useful benchmark against which to compare others.

As is evidenced by his orange color and large segment devoted to pts and fgx, a large portion of Bryant’s statistical contributions came from scoring. However, these statistics likely do not give the full picture for Bryant, as his role for most of the duration of the tournament was to shut down the opposition’s best players, not unlike a “Doberman.”

Jason Kidd (very pale blue, about halfway down) is one of few players for whom pts is not the largest segment. Rather his defensive rebounds and assists took priority, although so too, unfortunately, did his turnovers and personal fouls.

Michael Redd (rusty color, much closer to the bottom of the list) offers an interesting example of the usefulness of such a visualization. The first thing one notices is that his pts sector is matched in size by his fgx sector–he missed almost as many baskets as he scored points. Tip for the uninitiated: this is not a productive way to play basketball.

Another way to look at the data is through parallel coordinate plots, which are useful for depicting the rank of an individual across multiple categories. Below, I present PC plots for each member of team USA, where the vertical axis indicates that individual’s rank in each of 9 metrics, relative to the entire pool of Olympic players. On each plot, for ease of comparison, I draw gray lines for the remainder of the US team, but highlight each player individually in their SPI color.


Click here if you want a whole window full of these parallel coordinate plots.

p4 is Points Per Points Possible, described above, AS:TO is the assist-to-turnover ratio, TR/min is total rebounds per minute, DEF:PF is (BK+ST)/PF, which is just an amateurish way of measuring defensive skill.

Looking at these plots, we can see that Wade performed very well. He is in the top four on the US team in each stat, and it is apparent that he is in the top half across the board among all Olympians. Redd, although he was called upon to provide a shooting spark off the bench, was mostly a dud, with a p4 among the lowest in the competition. Bryant was second lowest on the team, but his shooting efficiency looks to have been better than about a third of the Olympic players, and thus much better than Redd’s. Note that due to a small sample size, some of these ranks will appear odd, namely Redd’s high ranking on the DEF:PF statistic and Kidd’s high p4 rating. Neither of high rankings are what we would expect from these players, but Redd played relatively few minutes, and Kidd only took shots he couldn’t refuse to take, resulting in good ratings for in these areas over a small number of observations.

I would be very interested to hear any more insights you glean from the above displays–feel free to copy any of the charts for your own use, just also please provide a link back to HP.

Olympic style

We’ve seen the NBA styles of the players that make up Team USA, we’ve seen their SPI factors and even their specific statistical breakdown. Now, we turn to a full SPI Spectrum graphic depicting each Olympic competitor, and their type, based solely on their production in the Olympics. Player names are scaled according to their MEV totals, so that the most productive players are the easiest to spot.


Fullscreen Version

Several things stand out to me. First, I am impressed by the degree to which this Olympics-based diagram matches up with the NBA-based diagram, for players who appear in both. Redd, Bryant, Williams, Kirilenko, Howard, Yao and Boozer all played similar styles in these Olympic games as they did in the 07-08 NBA.

Even more enlightening are the differences: Louis Scola played much more of a scoring role for Argentina than he does for the Rockets (understandably so). Dwyane Wade and Chris Paul shifted their focus away from scoring, relative to their NBA style, likely because they were not required on this team to carry their team’s point production. Anthony’s purported focus on rebounding is reflected in his shift from a somewhat perimeter-biased Scorer to an Interior Scoring type. Jason Kidd became an even more extreme Scorer’s Opposite, eschewing shooting opportunities whenever possible.

The most significant shift, however, might be seen in the play of LeBron James. Last season in the NBA, James lined up at about 12 o’clock on the diagram; the style with which he most closely aligned was Perimeter Scorer. In these Olympics, however, James’ style reflects his commitment to doing whatever was needed by the team. His minty-green color and placement at a little before 11 o’clock reflect his Pure Perimeter style, though his relative proximity to the center of the diagram indicates that his fit here is not perfect. Rather than being the primary scorer for this team, as he is accustomed to being in Cleveland, James stepped up the defensive intensity, leading his team in blocks (with eight), and finishing second in the tournament in steals (with 19!), not to mention leading the tournament, by a landslide, in menacing scowls. Further, he was second on the US team in assists (30; Paul had 33), his assist-to-turnover ratio was a respectable 1.76, and he finished in the tournament top ten in total rebounds. To put it in perspective, the role James filled for this US team was similar to that played by Magic Johnson on the showtime Lakers, which is quite a niche, indeed.

Conclusion

In sum, we can see that at least some of the hype is true. There has been some well-placed cynicism regarding the extent to which the “Redeem Team,” and our collective impression thereof, is a product of marketing. I have no doubt that at least some of what we believe about this team and its players is fabricated for the purpose of generating a positive image, and greater sales. However, at least two claims made about this team can be empirically verified, and I have tried to do that here.

The first claim is that this team is different from the failures which came before. Using NBA statistics and the SPI Typology, I am inclined to believe that in construction, this team is different than its three previous iterations, and more similar in design to the Dream Teams of the 1990s.

The second claim is that the players on this team changed their styles to accomodate each other, to better fit together as a team. Comparing SPI positions in the Olympics to SPI positions in the NBA, we can see which players had similar statistical distributions, and those which modified their style. Each player on the US team was either accustomed to or able to lead their NBA teams in scoring on any given night, and in Olympic competition, this ability to rely on others to score allows (at least theoretically) unselfish play. The question was always whether or not this team of able shooters would be able to “put aside their egos” and fill a specific role for this team, which may or may not include a substantial amount of offensive production. By and large, it appears as though the players asked to do so have responded positively. Though several US team members played with styles similar to their NBA styles, this reflected the reported desire of the coaching staff and management of the team (i.e. Michael Redd is supposed to be a shooter). Other players saw drastic shifts in their style of play, especially movement away from a focus on scoring, as a universally capable offense permitted each individual to do less of the shooting than may be required on their NBA squads. Based on this graphical evidence, I am willing to advance a tentative rejection of the null hypothesis that the players did not fill the roles they were asked to. Rather, it appears as though they played as a cohesive unit, maximizing their strengths and possibly sacrificing for the team.

I hope this late coverage was worth waiting for. I would be very interested in hearing your reactions to any of the ideas I’ve put forward, and I would especially like to know if you see any interesting relationships jump out in any of the SPI diagrams. I haven’t even begun here to discuss the interesting similarities between several of the international players and those from our own NBA in the Olympics, I suppose I will leave that to you. As usual, I’d love to hear from you in the comments, and in the survey, and please Buzz this up!







¹ If you are particularly interested in game-by-game contributions and value, I did track a modified version of MVP for team USA throughout the Olympics and pre-Games warmups. You can see the per-game and cumulative results here.

The Arbitrarian: Individual Contributions To Team Success

Note for those viewing this post in a feedreader: today’s Arbitrarian is very graphics-heavy, and the images won’t show up in the feed version of the post. If you want the full experience, I suggest clicking through to see the original at HP.

Last week, we looked at player productivity, as based on box score output. Today, we’re going to look at value, which, as you will see, is somewhat different than productivity.

The MVP is not necessarily the best player in the league, nor the most efficient. Many times, the MVP award goes to the player widely considered to be the best player on one of the better teams, but when it comes to arbitrating between comparing the best players on several of the best teams, there appears to be no hard-and-fast rule, and subjectivity enters into play. Today, I will propose that value ought to be quantified in terms of individual contributions to team success, where success is measured in wins.

If we can estimate the number of wins for which each player is responsible, we can do away with the arbitrary focus on only the best few teams. It is theoretically possible, for example, for the most valuable player to be an absolutely dominant but lonely contributor on a middling team, while the better teams each have enough decent players that no single one can be credited with a large portion of their success. We are still, however, left with the problem of objectively measuring each player’s contribution to team wins. To do this, I’d first like to explore…

A non-basketball thought experiment

Imagine a lemonade stand owned and staffed by Xavier, Yvette, and Zach. They make money by selling home-brewed lemonade at the end of their cul-de-sac, and only one of them staffs the stand at any given time. After their first month in business, they look at their lemonade sales revenue, and try to figure out which salesperson deserves what part of the income. One option would be to split the revenue into thirds–three employees, three parts. Zach claims that such a distribution is unfair because he worked over half of the total number of hours, while Yvette and Xavier worked about a quarter of the hours each. He claims that the distribution should thus be more like (1/4, 1/4, 1/2).

Xavier points out, however, that if they are trying to assess each employee’s value, they should try to find a more specific measure of actual revenue generated by each seller. He suggests that, since revenue is generated by lemonade sales, revenue generation should be measured in terms of the number of lemonades sold by each employee. Since they kept detailed records of such numbers, this is easy to calculate: Xavier sold 2/5 of all glasses, Yvette 1/2, and Zach just 1/10. Zach is disappointed that his pay-per-hour gambit was foiled, but must concede that this arrangement is more just–Yvette and Xavier are much better salespersons, and did more to help the company make money, while Zach mostly daydreamed during his hours on the job.

Back to basketball

What I have in mind is the application of a similar methodology to basketball. We have an excellent estimator of aggregate value–team wins; and credit for these wins can be apportioned to the players who work for those wins. There are a plethora of ways this could be done–we could arbitrarily estimate credit for each player on each team: The superstar might get 50% of the credit, the rest of the starters get 10% of the credit each, while the remainder is split amongst the bench players. Perhaps we could look at minutes played–after all, ceteris paribus, removing a mediocre player, and replacing him with a better player for the same number of minutes, should result in a greater number of team wins. Similarly, increasing the number of minutes played by a good player (to a point) should increase wins, while increasing minutes played by a bad player should lead to fewer wins.

This method isn’t foolproof, however: certain high-minute players might be daydreamer-types like Zach in the example above, while others might be feverishly productive. Consider, for example, Matt Carroll versus Yao Ming in 2007-08. Both played roughly the same number of minutes (2016 and 2044), but Yao was substantially more productive (by almost any measure) than was Carroll in that amount of time. Estimates of their value should reflect this difference.

Instead of minutes, I have chosen to use Model-Estimated Value (or MEV, discussed here) as an estimate of player productivity. There are several advantages to this choice, but two stand out. First, as discussed previously, MEV is a good estimator of per-game productivity, and so is more helpful to us than looking at, say, games played, minutes played, or points scored alone.

The second advantage comes from the fact that MEV does not perfectly capture player value. If it did, then team-level MEV would correlate perfectly with team wins, and we would not need separate measures for productivity and value. Rather, since some aspects of player value are omitted from the box score–things like defense, effort, intensity, etc–we may scale our MEV productivity estimates by team success, which does implicitly measure all of each player’s contributions.

Bruce Bowen, for example, had a 07-08 per-game MEV of 5.60, which put him below, among others, Wally Szczerbiak. Many would cite this as an example of the failings of MEV–its inability to fully measure defense (not to mention a lack of adjustment for playing time and pace) leads to an undervaluation of players like Bowen. However, Bowen’s defense does show up in the Spurs’ success–no small part of their winning can be attributed to his contributions. Similarly for Szczerbiak–his contributions are reflected in the success had by Cleveland and Seattle–that is, relatively little success. Thus, by crediting players for team success, using MEV as our measure of productivity, we may get closer to measuring each players’ actual value.

This method is still not perfect. We might still be undervaluing Bowen’s relative contribution to the Spurs, and overvaluing Szczerbiak’s contribution to his teams. However, given two players with idential MEV numbers on teams with otherwise identical rosters, the player whose MEV is “worth more” will help his team win more games.

Calculation and results

The measurement of each player’s value to their team is straightforward. Merely take each player’s season total MEV for a given team, and divide it by that team’s season total MEV. This gives us a metric I call Percent Valuable Contributions, or PVC. For Kevin Garnett in 07-08, this calculation takes his season total MEV (1,534.19) and divides it by that of the Celtics as a whole (8,282.62), resulting in a percentage (expressed as a decimal): 0.185. This means that Garnett is responsible for 18.5% of the Celtics’ success, which is a rather large portion, indeed. From here, estimation of value is very easy. Simply take this PVC number, and multiply it by team wins. This gives you each player’s BoxScores (BXS), their individual contribution to team success. The first tab on the table below depicts the numbers that go into the BoxScores calculation for each player in the 2007-08 season.



I’ve sorted each player by their PVC for the sake of comparison. In terms of value to their team, the top three players are James, Paul, and Jefferson. All three are good players, but certainly Jefferson is a step below the other two. Since BoxScores accounts for team success, we can clearly see Jefferson’s actual value is much less than that of James and Paul–he might be the most valuable player on the Timberwolves roster, but such is not a high distinction.

For more insight, note the series of players whose PVC comes it at around 0.185: Steve Nash, Joe Johnson, Kevin Garnett, Richard Jefferson, and Carlos Boozer. Even the casual fan knows that these players are not all equally valuable, though they may be equally valuable to their respective teams. Note that Nash and Boozer generated a much higher MEV total than the other three–this is largely because, as the table also shows, Phoenix and Utah had much greater team MEV totals, thanks to a faster-paced playing style. Team wins complete the picture–Richard Jefferson was responsible for 18.5% of his team’s 34 wins. His value is thus estimated at 6.29 BXS. Kevin Garnett was responsible for 18.5% of his team’s 66 wins. His value is thus 12.23 BXS. As you can see, by accurately measuring productivity (MEV), and accounting for team success (wins), we are able to objectively assess each player’s value (BXS).

An aside into rated productivity

Another useful measure, especially for comparing players on poor teams, or those who played limited minutes, is what I call the Valuable Contributions Ratio (VCR). This is a pace- and playing time- adjusted metric of productivity assessed at the per-minute level. As above, this calculation is straightforward and intuitive. Merely take each player’s PVC (MEV/team MEV) and divide it by each player’s percent of team minutes played (min/team min). Thus, we are dividing a percentage by another percentage (which is why I call it a ratio–units are somewhat meaningless). This statistic controls for team pace and playing time, and is independent of team quality–it captures productivity relative to the time allowed for production.

This is useful for comparing bench players, players who miss a substantial number of games, and rookies. Bench players get a “fair shake” by this statistic, because they often have less time on the floor in which to accumulate MEV toward a larger cumulative share of team success. Same for injured players–Andrew Bynum did not play very many games for the Lakers in 07-08, and as such was less valuable in terms of team wins. However, when he did play, he produced very efficiently, with a VCR of 1.36. (This means that he was responsible for 1.36% of his team’s production for every 1% of team minutes played–which is very efficient.) VCR is useful for comparing rookies, as well, since they often play relatively few minutes, and since their teams often win very few games. Rookies with high BXS are the most impressive, but more often than not, rookies don’t produce many wins. Rather, they may produce MEV efficiently, and we can see this in VCR. Among rookies with substantial playing time in 07-08, Carl Landry produced the most efficiently, with a very respectable VCR of 1.39. (The Arbitrary Rookie of the Year, Kevin Durant, was 7th among rookies by VCR, and 5th on the rookie BXS list after Scola, Horford, Moon, and Thaddeus Young. He did, however, lead all rookies in points per game. WoW Club!)

BXS MVPs and all-time greats

Look again at the table above, but this time, select the second sheet, titled “07-08 BXS.” This lists each player from the 07-08 season, on each team for which he played, and includes measures of productivity (PPG, and MEV/gp), efficiency (VCR), and value in terms of BXS. As you can see, the obvious most valuable player in 2007-08 was Chris Paul, who was responsible for over a quarter of his team’s 56 wins, for a BXS of 15.41. Kobe Bryant, this year’s Arbitrary MVP, had a good showing as well, but was responsible for almost three fewer wins than was Paul. Kevin Garnett, who was thought to be a more subjective favorite for MVP, acquitted himself nicely in objective terms, by generating 12.23 wins even while missing eleven games.

For a more historical perspective, see the third tab, “BXS Seasons.” This lists the same information, but for the 500 most valuable seasons from the population of every professional basketball season since the beginning of the NBA, even including the ABA. Unsurprisingly, Chamberlain tops the list, although his most valuable season was not his most productive in MEV terms. Shaquille O’Neal’s dominating performance for the incredible 67-win 99-00 Lakers is the most valuable season in recent memory, followed closely by Jordan’s post-first-retirement 72-win season in Chicago. Perhaps surprising, but perhaps not to those who have always appreciated the Big Ticket, is Kevin Garnett’s extremely high value in 03-04. Always a valuable player to his team, Garnett and the Timberwolves finally put it together for one great year, and Garnett’s relative value (PVC) translated to absolute value (BXS).

The final tab, “BXS Careers,” accumulates the performance of 500 NBA greats. The table is sorted by BXS82, which is the number of wins each player would be expected to produce in 82 games played, given his career performance. There may be a few surprises, but they are instructive: The first is Alex Groza, who was a great player in the early years of professional basketball, but whose career was cut short. The second surprise might be the ordering of Michael Jordan, relative to Magic Johnson and Tim Duncan. Many fans and observers would identify Jordan one of the most, if not the most, valuable player ever, and here he ranks sixth at a per-game level. The first thing to note is that Duncan has always been more valuable than his box score statistics might indicate, and this is reflected in his BXS measure. Secondly, Duncan has not yet seen his productivity or value decline substantially due to aging. For the most part, Duncan’s 13.73 BXS/82 average comes from the peak of his career. Johnson retired after a relatively short career, and his comeback in 1996 was brief. Jordan, by comparison, had a second comeback for Washington during which he played 142 games of much less valuable basketball. If you exclude Jordan’s Washington years, his career BXS82 becomes 14.62, which puts him solidly above the other two.

Visualizing value

Now for the first graphical visualizations in the life of this young column. Since BXS is derived by multiplying player contributions (PVC) by team success (team wins), we can envision BXS itself as the area of a rectangle with sides of PVC and Wins. This lends itself to graphical expression, with the league as a rectangle, 41*30= 1230 wins wide, and 100% high. Partitions may be made on the horizontal axis for each team, scaling each section by that team’s number of victories. Within each team’s segment, further divisions can be made for each player, according to their contribution to team success. The best explanation is an example, displayed below:


Fullscreen Version

You may use the controls at the top left to zoom in and pan across the graphic to see more detail, or an expanded overview, as you wish. The “Fullscreen Version” link directs you to a much larger version of the same display, which may be easier to grok. The graphic above displays BXS for the 2007-08 season, with team success increasing from left to right, and player contributions increasing from bottom to top. Colors are derived from statistically-derived playing style, where red indicates a propensity for scoring, green denotes perimeter play, and blue highlights interior-play tendencies. Much more will be said about this classification scheme next week.

Several things I’d like to point out in the graphic above to get you started: First, note how tall the rectangles of Chris Paul, LeBron James and Al Jefferson are–this is because height is scaled according to PVC, and these three players were the most responsible for their teams’ success.

Looking across the top row of the graphic, we can identify each team’s Most Valuable Player. For the lowly Heat, Dwyane Wade was most valuable, despite injury. Calderon was most valuable in Toronto (7.1 BXS), though Bosh was a very close second (7.0 BXS). The most valuable player on the best team was Kevin Garnett, but since he had a very supportive team behind him, his individual value was somewhat less than that of Chris Paul, whose supporting cast drops off substantially in terms of contributions after Stojakovic.

One useful perspective granted by displaying contributions in this manner, is that it is easy to compare units across teams. For example, Boston was famed for the Big Three of Garnett, Pierce and Allen. Using the scale on the right of the graphic, we can see that together, these three accounted for almost half of Boston success. Looking across the graphic from the lowest part of Allen’s rectangle, however, we can see that the big three that was most valuable to their team can actually be found in New Orleans, where Paul, West and Chandler can be credited with almost 60% of the Hornets’ success. On the other side of the coin, Detroit, Houston, and Chicago all got a fairly balanced set of contributions, as their subjective reputations might have suggested. I would be very interested to hear about your own observations, as well as your opinions as to how well this graphic meshes with your subjective opinions, in the comments.

I’ve also developed an interactive presentation of the graphic above, with even more detailed statistics. Just follow the link below to the Interactive BoxScores Explorer page. The league-wide graphic has been scaled to fit in your browser window, and players’ statistical details pop up on mousover. Try it–it’s somewhat addictive.

Make sure to click around a little bit–I’ve created “player cards” for each individual, which display even more detailed statistical information, including their playing style, most and least similar players, the mean and standard deviation of their “counting” statistics, and a season-long sparkline of their productivity. Feel free to use them in any application you wish.

The player cards are a quick and easy way to quickly assess any player. For fantasy purposes, for example, if you’re comparing two players with similar averages in assists, you might want to pick the player with the smaller standard deviation about that mean, as indicated by the error bars in the middle section of the card. Alternatively, if you are interested in whether a certain player tends to produce more as the season goes on, the seasonal trends should give some insight into this, as well as how long it takes the player to recover fully from injury, or how much they produce when their minutes go up. Also, I’ve included each player’s most- and least- similar match, based on the 07-08 season, which can help give you an idea of the niche they fill on their team.

Historical BXS franchise timelines

Another excellent use of this BXS area diagramming visualization is to display franchise histories. Better years are represented by wider segments, and the best players rise to the top with tall rectangles. Eras can by identified by patterns in color. Here are two examples, the first depicting the LA Lakers franchise, and the second, Boston Celtics history:


Fullscreen Version

Several eras stand out in the graphic above. The Mikan era was eventually replaced by the Baylor/West dynasty which became the Chamberlain/West years. Notice, incidentally, how West becomes “greener” over the course of his career, indicating a shift away from focusing on scoring, and toward a concentration on other perimeter contributions. A Kareem era follows, though his best years are at this point behind him, and massive team success comes only with the addition of Magic Johnson. Johnson leads the Lakers for ten consecutive years and his retirement marks the end of an era of dominance. LA returns to form in the late 1990s in the hands of O’Neal and Bryant, who turn in some incredible performances–interestingly, there is an obvious breakpoint between 2001 and 2002 on the graphic indicating the switch from the Lakers being “Shaq’s team” to being “Kobe’s team.” The 2006 version of Bryant was forced to carry the scoring load to a massive degree, but the 2008 version (as is evidenced by a much less red color), has been freed up to focus less on point production, and more on doing other things to help his team win. Perhaps the 08-09 season will see Gasol and Bynum float to the top of the column, turning in full, healthy seasons for a very successful LA team.


Fullscreen Version

Boston history is marked even more clearly by the careers of its greatest players. The Celtics of the 1960s are consistently topped by Bill Russell’s defensive-interior blue, and bolstered by some great scorers, like Havlicek and Jones. The 1980s saw a parallel to the Lakers above, in which a perimeter player (here Bird) lead a team supported by strong interior scorers. The 1986 and 1987 Celtics offer an interesting starting lineup of all greens and blues–no single player was responsible for most of the scoring, while other types of contributions were made by all. After years of seasons with little success and narrow columns, the Celtics finally turned it around last season with the addition of Garnett and Allen, almost tripling 06-07′s win total.

The final graphic below offers an alternative take on presenting BoxScores, by tracing the careers of each of 50 NBA greats. Following the peaks and valleys of each player’s tenure, we can also see the years in which many stars were shining brightly. 1972 was a great year for the sport, as was 1990–many of the NBA’s greatest players had good seasons in these years, and the league may be seen to peak at these points. The graphic makes it possible to see the beginning of new eras–witness the start of Bird’s and Johnson’s careers, followed shortly by the rookie seasons of Thomas, Drexler, Jordan, Olajuwon, Stockton, and Barkley. We can see a big dip in strike-shortened 1999, and then another in 2004. This second dip may be troubling–has the quality of the league declined so sharply? Worry not–many modern greats retired in the years just before 2004, meaning that their layers drop out of the picture, setting the stage for the new era of NBA stars we are witnessing today.


Fullscreen Version

Conclusion

As always, I’d very much like to hear your opinions of BoxScores as a measure of value, as well as whether or not you think it gets things right. Was Magic really at his peak in 1987? Were Garnett and Pierce in 2008 in the same league as Bird and McHale in their prime? Should Chris Paul have been the MVP this past season? Were the Rockets really the most balanced of the good teams last year, and will the addition of Ron Artest make them that much more indomitable? Please feel free to leave a comment and take part in the now-customary brief survey below. Next week, I’ll go into much more detail on a new way to describe players, without having to use all those pesky words.

PostScript: Discussion of the “50-Win Standard”
I hope you took the time to read Josh Tucker’s excellent discussion on the established precedent of giving the MVP award only to players on teams with fifty or more wins. I have a few thoughts on how the 50-win minimum precedent fits in with the BoxScores methodology I’ve established here.

The first is that I essentially agree with the implied criteria of such a cutoff (or the implementation of the “Bryant-Nash Rule”). That is, I think that value should determine the MVP, and value is measured in wins, not strictly in individual statistics.

However, as an Arbitrarian, I would tend to shy away from establishing an arbitrary (though precedented) line of demarcation between those who should and should not be under consideration. I think that if you put Wilt Chamberlain on a team with 11 kindergartners, and that team won 41 games, I’d want to consider Chamberlain for MVP. That is, there should be a sliding scale, in the sense that each of the Detroit Pistons individually are less valuable than a single LeBron James, though the Pistons collectively tend to do better than do the Cavaliers collectively.

This is built right into the estimation of BoxScores: A player contributes X% of the production for a team with Y wins, and so he is credited with X•Y of those wins. Fortunately, from the standpoint of the 50-win precedent, as team wins decrease, it gets harder and harder for any player to outproduce a player on a 50+ win team.

For example, imagine a season in which player A contributes 20% of the production for a 50-win team. (20% is on the low end for MVP-candidate PVC, and 50 wins is at the low end for a contender as well, so this is a conservative estimate for an MVP frontrunner.) Such a player accumulated 0.2*50 = 10 BXS. Player B, let’s say, is on a 41-win team. In order to be more valuable than A, B would have to be responsible for 10/41 = 24.4% of his team’s production, which is very high, indeed. In 07-08, only two players had more than a quarter of their team’s valuable contributions, and one of those was the BoxScores MVP, Chris Paul (whose team had 56 wins). LeBron James was the other, and despite contributing more than 2/7 of his team’s production, Cleveland’s win total of 45 reflected James’ second-most-valuable status.

In sum, even if we use BoxScores as our measure of value, it is highly unlikely (although not impossible) that the MVP will come from a sub-50-win team. The precedent will likely remain intact.