Hey all you cool cats and kittens, Joe Exotic here to take you on a wild ride in the world of (basic) statistical analysis of some football stats and the NSFL. First off, who am I? I am Gamaliel, my player is Joe Exotic – DT. I recently created and am eligible? for drafting in the DSFL. I found out about the NSFL from the PBE and am fairly active in that league. Having just joined the NSFL, I know literally nothing about the teams, players, or the league in general. So what is the best way to learn about a league like this? Well the best way to take a deep dive into the stats from the games. I want to emphasis again that I haven’t read any past articles about this and might be rehashing some already tired arguments, or I might be completely wrong because again, I know nothing about this league.
I started this analysis with the question. What in this league predicts the success for a defensive lineman? My working “definition” for success in this will be counting stats, those are nice and quantifiable and easily tracked from the Index. And what do defensive lineman do? They tackle people. So I’m going to start this with the question. What attribute predicts the amount of tackles for the NSFL Defensive Lineman?
For starters, I pulled in information from the Index for this year’s individual defensive stats. Sidenote – If anybody has a convenient way of pulling historical data over a number of seasons I would love to get my hands on that because currently it’s a bit of small sample size zone. Then I joined this data to TPE tracker data which shows TPE totals and where the TPE has been applied to, very handy for looking at the impacts of individual attributes. I then cleaned this data up, removing any free agents and any players who haven’t been seen in 2020. I ended up with 32 players that accumulated stats in this most recent season of the NSFL. Let me know if I missed anyone in this. I’m pretty happy with how clean the data was. Also, I am using a program called Alteryx for all of this data cleansing and analysis. You see this data in the picture below:
![[Image: XFF6KyL.png]](https://i.imgur.com/XFF6KyL.png)
Alright, time to run this data through a linear regression. First off, how does current TPE impact tackles?
![[Image: bb4I7nR.png]](https://i.imgur.com/bb4I7nR.png)
We can see a fairly decent relationship (for “real” data) with about 22% of tackles explained by TPE totals. Let’s look at some attributes that can also effect tackles, namely, Tackle, Speed, and Strength
You would think that Tackle would predict tackles right?
![[Image: nCl7zhd.png]](https://i.imgur.com/nCl7zhd.png)
Not quite, an even weaker relationship than just total TPE. What about Speed?
![[Image: slnQawK.png]](https://i.imgur.com/slnQawK.png)
Oof, no need to put speed in based on this linear regression. Time for Strength.
![[Image: fSUe27m.png]](https://i.imgur.com/fSUe27m.png)
Again, not a great relationship. Time to try a multiple regression with all 4 of these variables.
![[Image: ijhwKTJ.png]](https://i.imgur.com/ijhwKTJ.png)
Hey! This regression is decent, but not great. This thought led me to the thought that maybe it’s not really a great idea to be looking just at tackles for the defensive linemen because most tackles don’t happen from a linemen. They are more likely to have a tackle for a loss or a sack, and most team’s percentage of tackles are going to be coming from either their secondary or linebackers who are covering the receivers and running backs. So to help my investigation out, I took the team total defensive stats and compared them to the DE and DT summed totals that I was already looking and found which percentages of Tackles, Tackles for losses, and Sacks came from the defensive linemen for each team and took the average of that. Check out the results below:
![[Image: t3IDSrS.png]](https://i.imgur.com/t3IDSrS.png)
So based on these results, on average only 12% of a team’s tackles are coming from their D Lineman, but almost 40% of sacks and tackles for losses came from the DLine! It seems that those two stats would be better at taking a look at. For brevity’s sake, I am just going to highlight a few more of these that illustrate some of the better R-squared values. First off, Sacks:
![[Image: 56UDG2h.png]](https://i.imgur.com/56UDG2h.png)
There wasn’t anything that was a great predictor of sacks individually, but all together they made a decent estimate. I wonder if this was because of the role of team strategies, Quarterback elusiveness, and just plain luck. Any thoughts from anyone on that? Next up, Tackle for a loss.
![[Image: cKM9ii1.png]](https://i.imgur.com/cKM9ii1.png)
It turned out that the best predictor for tackles for a loss was speed. Which logically makes sense. Without speed how can you get behind the offensive linemen and tackle behind the line of scrimmage? Again, I think there is a lot of small sample size baked into these results, but it is at least interesting to see which factors can play into these stats. Alright all, roast me on why my method is flawed!
I started this analysis with the question. What in this league predicts the success for a defensive lineman? My working “definition” for success in this will be counting stats, those are nice and quantifiable and easily tracked from the Index. And what do defensive lineman do? They tackle people. So I’m going to start this with the question. What attribute predicts the amount of tackles for the NSFL Defensive Lineman?
For starters, I pulled in information from the Index for this year’s individual defensive stats. Sidenote – If anybody has a convenient way of pulling historical data over a number of seasons I would love to get my hands on that because currently it’s a bit of small sample size zone. Then I joined this data to TPE tracker data which shows TPE totals and where the TPE has been applied to, very handy for looking at the impacts of individual attributes. I then cleaned this data up, removing any free agents and any players who haven’t been seen in 2020. I ended up with 32 players that accumulated stats in this most recent season of the NSFL. Let me know if I missed anyone in this. I’m pretty happy with how clean the data was. Also, I am using a program called Alteryx for all of this data cleansing and analysis. You see this data in the picture below:
![[Image: XFF6KyL.png]](https://i.imgur.com/XFF6KyL.png)
Alright, time to run this data through a linear regression. First off, how does current TPE impact tackles?
![[Image: bb4I7nR.png]](https://i.imgur.com/bb4I7nR.png)
We can see a fairly decent relationship (for “real” data) with about 22% of tackles explained by TPE totals. Let’s look at some attributes that can also effect tackles, namely, Tackle, Speed, and Strength
You would think that Tackle would predict tackles right?
![[Image: nCl7zhd.png]](https://i.imgur.com/nCl7zhd.png)
Not quite, an even weaker relationship than just total TPE. What about Speed?
![[Image: slnQawK.png]](https://i.imgur.com/slnQawK.png)
Oof, no need to put speed in based on this linear regression. Time for Strength.
![[Image: fSUe27m.png]](https://i.imgur.com/fSUe27m.png)
Again, not a great relationship. Time to try a multiple regression with all 4 of these variables.
![[Image: ijhwKTJ.png]](https://i.imgur.com/ijhwKTJ.png)
Hey! This regression is decent, but not great. This thought led me to the thought that maybe it’s not really a great idea to be looking just at tackles for the defensive linemen because most tackles don’t happen from a linemen. They are more likely to have a tackle for a loss or a sack, and most team’s percentage of tackles are going to be coming from either their secondary or linebackers who are covering the receivers and running backs. So to help my investigation out, I took the team total defensive stats and compared them to the DE and DT summed totals that I was already looking and found which percentages of Tackles, Tackles for losses, and Sacks came from the defensive linemen for each team and took the average of that. Check out the results below:
![[Image: t3IDSrS.png]](https://i.imgur.com/t3IDSrS.png)
So based on these results, on average only 12% of a team’s tackles are coming from their D Lineman, but almost 40% of sacks and tackles for losses came from the DLine! It seems that those two stats would be better at taking a look at. For brevity’s sake, I am just going to highlight a few more of these that illustrate some of the better R-squared values. First off, Sacks:
![[Image: 56UDG2h.png]](https://i.imgur.com/56UDG2h.png)
There wasn’t anything that was a great predictor of sacks individually, but all together they made a decent estimate. I wonder if this was because of the role of team strategies, Quarterback elusiveness, and just plain luck. Any thoughts from anyone on that? Next up, Tackle for a loss.
![[Image: cKM9ii1.png]](https://i.imgur.com/cKM9ii1.png)
It turned out that the best predictor for tackles for a loss was speed. Which logically makes sense. Without speed how can you get behind the offensive linemen and tackle behind the line of scrimmage? Again, I think there is a lot of small sample size baked into these results, but it is at least interesting to see which factors can play into these stats. Alright all, roast me on why my method is flawed!