`

Python利用BeautifulSoup抓取解析网页数据

阅读更多
网页UI以及HTML组织形式,目的是抓取网页数据并解析。
 <div class="clan__table">
      <div class="clan__headers">
        <div class="clan__headerCaption">Rank</div>
        <div class="clan__headerCaption">Name</div>
        <div class="clan__headerCaption">Level</div>
        <div class="clan__headerCaption">League</div>
        <div class="clan__headerCaption">Trophies</div>
        <div class="clan__headerCaption">Donations</div>
        <div class="clan__headerCaption">Role</div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #1
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/2P0V2CCY">北斗</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">11</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__2"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">4438</div>
        </div>
        <div class="clan__row">379</div>
        <div class="clan__row">
             Leader
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #2
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/9UURJRQU">wglj</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">12</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__2"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">4344</div>
        </div>
        <div class="clan__row">498</div>
        <div class="clan__row">
             Co-Leader
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #3
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/2LQYLYJQ">大彪客</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">11</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__1"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">4276</div>
        </div>
        <div class="clan__row">322</div>
        <div class="clan__row">
             Co-Leader
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #4
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/9PLUY020">冰城好警</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">12</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__1"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">4229</div>
        </div>
        <div class="clan__row">264</div>
        <div class="clan__row">
             Co-Leader
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #5
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/2L9UPJP0">你可以的</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">12</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__1"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">4220</div>
        </div>
        <div class="clan__row">380</div>
        <div class="clan__row">
             Co-Leader
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #6
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/9U8QLY9V">cavan_陈</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">11</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__1"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">4205</div>
        </div>
        <div class="clan__row">204</div>
        <div class="clan__row">
             Co-Leader
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #7
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/YUJYLJJ">小新</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">11</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__1"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">4171</div>
        </div>
        <div class="clan__row">308</div>
        <div class="clan__row">
             Co-Leader
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #8
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/PL9VU80U">指挥若定</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">11</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__1"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">4154</div>
        </div>
        <div class="clan__row">186</div>
        <div class="clan__row">
             Co-Leader
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #9
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/8RLQJQRR">hwj</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">10</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__1"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">4023</div>
        </div>
        <div class="clan__row">48</div>
        <div class="clan__row">
             Elder
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #10
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/QQUV9RR9">路飞</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">11</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__1"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">4000</div>
        </div>
        <div class="clan__row">202</div>
        <div class="clan__row">
             Co-Leader
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #11
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/20RRU0LJ2">奥格瑞玛</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">11</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">3893</div>
        </div>
        <div class="clan__row">470</div>
        <div class="clan__row">
             Co-Leader
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #12
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/LCP9CL8C">Subway</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">11</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">3882</div>
        </div>
        <div class="clan__row">72</div>
        <div class="clan__row">
             Co-Leader
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #13
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/8Q2J8LVP">会飞の鱼</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">11</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">3873</div>
        </div>
        <div class="clan__row">199</div>
        <div class="clan__row">
             Co-Leader
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #14
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/QY2Y9R0J">柔柔bb</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">11</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">3862</div>
        </div>
        <div class="clan__row">178</div>
        <div class="clan__row">
             Co-Leader
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #15
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/U2R2LGR2">北斗</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">10</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">3812</div>
        </div>
        <div class="clan__row">38</div>
        <div class="clan__row">
             Co-Leader
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #16
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/QGVYJCYG">Paul</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">10</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">3769</div>
        </div>
        <div class="clan__row">42</div>
        <div class="clan__row">
             Elder
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #17
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/92Q29R0U">天黑</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">10</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">3692</div>
        </div>
        <div class="clan__row">100</div>
        <div class="clan__row">
             Elder
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #18
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/92QLQPUU">Addison Liu</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">11</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">3656</div>
        </div>
        <div class="clan__row">240</div>
        <div class="clan__row">
             Co-Leader
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #19
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/YQRPCJ9P">系统奖励</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">11</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">3615</div>
        </div>
        <div class="clan__row">376</div>
        <div class="clan__row">
             Co-Leader
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #20
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/JUGJ88GP">二牛和小姨子</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">9</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">3550</div>
        </div>
        <div class="clan__row">384</div>
        <div class="clan__row">
             Co-Leader
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #21
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/80GUU0U2">阿牛someone</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">11</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">3514</div>
        </div>
        <div class="clan__row">74</div>
        <div class="clan__row">
             Elder
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #22
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/GPGLJ0C">森林大地</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">10</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">3480</div>
        </div>
        <div class="clan__row">188</div>
        <div class="clan__row">
             Co-Leader
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #23
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/2L29R2V8">george</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">11</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">3459</div>
        </div>
        <div class="clan__row">166</div>
        <div class="clan__row">
             Co-Leader
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #24
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/UPPGLURG">Big Bang is VIP</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">10</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">3380</div>
        </div>
        <div class="clan__row">92</div>
        <div class="clan__row">
             Co-Leader
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #25
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/22GUQYV8Y">佐为</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">10</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">3305</div>
        </div>
        <div class="clan__row">43</div>
        <div class="clan__row">
             Member
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #26
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/2PLQQLPC">X-HUNTER</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">11</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">3264</div>
        </div>
        <div class="clan__row">168</div>
        <div class="clan__row">
             Co-Leader
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #27
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/UG8JCJ8C">拾蛤蜊</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">10</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">3253</div>
        </div>
        <div class="clan__row">300</div>
        <div class="clan__row">
             Co-Leader
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #28
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/229J9U2GR">RT</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">10</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">3230</div>
        </div>
        <div class="clan__row">108</div>
        <div class="clan__row">
             Elder
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #29
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/U22L0GGQ">忍者老宝</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">11</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">3200</div>
        </div>
        <div class="clan__row">64</div>
        <div class="clan__row">
             Elder
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #30
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/CCGLGGJC">RT</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">10</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">3196</div>
        </div>
        <div class="clan__row">244</div>
        <div class="clan__row">
             Elder
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #31
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/8YL9LJLC">林GHL</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">10</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">3147</div>
        </div>
        <div class="clan__row">341</div>
        <div class="clan__row">
             Co-Leader
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #32
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/Y8CJQGVR">一生註定是學渣</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">10</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">3030</div>
        </div>
        <div class="clan__row">0</div>
        <div class="clan__row">
             Co-Leader
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #33
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/2229LG9PG">万小宝</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">9</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">3014</div>
        </div>
        <div class="clan__row">10</div>
        <div class="clan__row">
             Member
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #34
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/8PQRCULJ">sakura怡</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">10</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">2846</div>
        </div>
        <div class="clan__row">32</div>
        <div class="clan__row">
             Elder
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #35
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/9YC2LUPR">King丶梦屹</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">9</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">2841</div>
        </div>
        <div class="clan__row">286</div>
        <div class="clan__row">
             Elder
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #36
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/V8QQ8V29">骚铭之父</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">9</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">2811</div>
        </div>
        <div class="clan__row">20</div>
        <div class="clan__row">
             Member
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #37
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/88Y9RPLP2">1L领域</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">9</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">2806</div>
        </div>
        <div class="clan__row">158</div>
        <div class="clan__row">
             Co-Leader
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #38
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/28JR0VPP9">СаНёК</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">9</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">2785</div>
        </div>
        <div class="clan__row">60</div>
        <div class="clan__row">
             Member
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #39
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/28R0J0JR2">aqil</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">10</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">2753</div>
        </div>
        <div class="clan__row">34</div>
        <div class="clan__row">
             Member
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #40
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/CQ0VPJJL">#Pepapig</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">8</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">2744</div>
        </div>
        <div class="clan__row">20</div>
        <div class="clan__row">
             Member
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #41
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/YCGVRJ02">mohinka</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">9</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">2656</div>
        </div>
        <div class="clan__row">108</div>
        <div class="clan__row">
             Member
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #42
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/2JRCC2LY9">Ben先生</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">8</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">2655</div>
        </div>
        <div class="clan__row">116</div>
        <div class="clan__row">
             Co-Leader
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #43
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/L8GR8Q9J">寂寞真离骚l</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">8</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">2625</div>
        </div>
        <div class="clan__row">30</div>
        <div class="clan__row">
             Elder
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #44
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/VRRY8PUC">路海战队</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">10</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">2623</div>
        </div>
        <div class="clan__row">0</div>
        <div class="clan__row">
             Member
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #45
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/PVQY29QL">Supergirl</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">10</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">2544</div>
        </div>
        <div class="clan__row">0</div>
        <div class="clan__row">
             Member
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #46
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/G0GQQ809">三刀索隆</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">10</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">2542</div>
        </div>
        <div class="clan__row">0</div>
        <div class="clan__row">
             Member
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #47
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/8JGY8R0C">York royale</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">9</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">2472</div>
        </div>
        <div class="clan__row">0</div>
        <div class="clan__row">
             Elder
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #48
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/2QR080LCV">famidore</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">10</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">2443</div>
        </div>
        <div class="clan__row">88</div>
        <div class="clan__row">
             Member
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #49
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/2VVY9VG"># vj</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">9</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">2371</div>
        </div>
        <div class="clan__row">0</div>
        <div class="clan__row">
             Member
                    </div>
      </div>

      
      <div class="clan__rowContainer">
        <div class="clan__row">
                            #50
                    </div>
        <div class="clan__row">
          <a class="ui__blueLink" href="/profile/JVCU0Q0V">Forever丿old8</a>
        </div>
        <div class="clan__row">
          <span class="clan__playerLevel">9</span>
        </div>
        <div class="clan__row">
          <div class="clan__leagueContainer">
                            <div class="league__0"></div>
                        </div>
        </div>
        <div class="clan__row">
          <div class="clan__cup">2349</div>
        </div>
        <div class="clan__row">40</div>
        <div class="clan__row">
             Member
                    </div>
      </div>

          </div>

 

 通过查看页面源代码,我们发现每一个玩家信息都是存储在一个class为clan__rowContainer的div中。

那么我们就可以通过soup的finaAll选择器来获取所有行的玩家信息,然后遍历挨个解析玩家数据。

具体代码:

for i, row in enumerate(soup.findAll("div",attrs = {"class":"clan__rowContainer"})):
        user_dict = {}
        for j,col in enumerate(row.findAll("div",attrs = {"class":"clan__row"})):
            if j == 0:
                user_dict["rank"] = col.string.strip().replace("#","")
            elif j == 1:
                user_dict["name"] = col.a.string.strip()
                user_dict["uid"] = col.a.get("href").strip("/profile/")
            elif j == 2:
                user_dict["level"] = col.span.string.strip()
            elif j == 3:
                user_dict["league"] = col.contents[1].div.get("class")[0].replace("league__","")
            elif j == 4:
                user_dict["score"] = col.div.string.strip()
            elif j == 5:
                user_dict["donations"] = col.string.strip()
            elif j == 6:
                user_dict["role"] = col.string.strip()
        print(user_dict)
        

   

 

  • 大小: 417 KB
  • 大小: 486.6 KB
分享到:
评论

相关推荐

    python基于BeautifulSoup实现抓取网页指定内容的方法

    主要介绍了python基于BeautifulSoup实现抓取网页指定内容的方法,涉及Python使用BeautifulSoup模块解析html网页的相关技巧,具有一定参考借鉴价值,需要的朋友可以参考下

    Python利用BeautifulSoup解析Html的方法示例

    BeautifulSoup是python的一个库,最主要的功能是从网页抓取数据。下面这篇文章主要给大家介绍了关于Python利用BeautifulSoup解析Html的方法示例,文中通过示例代码介绍的非常详细,需要的朋友们下面跟着小编来一起...

    Coursera_Capstone:该项目旨在使用聚类和Foursquare API对多伦多市内的社区进行细分和聚类。 此外,您还将学习如何使用Python包Beautifulsoup抓取网站并解析HTML代码,以及如何将数据转换为pandas数据框

    Coursera_Capstone 该项目旨在使用聚类和Foursquare API对多伦多市内的社区进行细分和聚类。 此外,您还将学习如何使用Python包Beautifulsoup抓取网站并解析HTML代码,以及如何将数据转换为pandas数据框。

    Python使用BeautifulSoup库解析HTML基本使用教程

    BeautifulSoup是Python的一个第三方库,可用于帮助解析html/XML等内容,以抓取特定的网页信息。目前最新的是v4版本,这里主要总结一下我使用的v3版本解析html的一些常用方法。 准备 1.Beautiful Soup安装 为了能够对...

    百度首页数据抓取:用Python快速入门网络爬虫

    在这个教程中,我们将以爬取百度首页为例,介绍如何使用Python语言进行简单的网页数据抓取。 本教程主要分为四个部分: 导入所需库:介绍了在爬虫程序中需要使用的Python库,包括requests库(用于向网站发送HTTP...

    Python HTML解析器BeautifulSoup用法实例详解【爬虫解析器】

    简单来说,BeautifulSoup最主要的功能是从网页抓取数据。本文我们来感受一下BeautifulSoup的优雅而强大的功能吧! BeautifulSoup安装 BeautifulSoup3 目前已经停止开发,推荐在现在的项目中使用BeautifulSoup4,不

    Python爬虫,轻松获取网络数据的利器!

    2. 解析网页:通过Python的BeautifulSoup库,我们可以轻松地解析HTML和XML文档,提取所需的数据。 3. 保存数据:可以将提取的数据保存到本地文件或数据库中,方便后续处理和分析。 此外,Python爬虫还具有高级功能,...

    Python实现抓取百度搜索结果页的网站标题信息

    比如,你想采集标题中包含“58同城”的SERP结果,并过滤包含有“北京”或“厦门”等结果数据。 该Python脚本主要是实现以上功能。 其中,使用BeautifulSoup来解析HTML,可以参考我的另外一篇文章:Windows8下安装...

    Python基础应用实例-爬取

    网络爬虫,又称为网页蜘蛛,是一种按照一定的规则,自动地抓取万维网信息的程序或者脚本。...利用Python编程语言及相关的爬虫技术,我们可以轻松地抓取这些音乐数据,并进行深入的分析,为音乐产业的发展提供有力支持。

    BeautifulSoup

    Beautiful Soup 是用 Python 写的一个 HTML/XML 的解析器,它可以很好的处理不规范标记并生成剖析树。通常用来分析爬虫抓取的web文档。 注意:为了解决乱码问题,用版本3的。如3.2.1。BeautifulSoup处理后的默认编码...

    爬虫+Python爬虫案例+Python技术+案例分析

    编写爬虫代码:使用Python的requests库发送HTTP请求,获取API返回的JSON数据,并利用BeautifulSoup库解析这些数据。 数据提取与存储:从解析后的数据中提取所需的信息,如新闻标题和链接,并将它们保存到本地文件中...

    Python基于BeautifulSoup爬取京东商品信息

    今天小编利用美丽的汤来为大家...如何利用BeautifulSoup抓取京东网商品信息 首先进入京东网,输入自己想要查询的商品,向服务器发送网页请求。在这里小编仍以关键词“狗粮”作为搜索对象,之后得到后面这一串网址: ...

    python爬虫讲解.docx

    解析网页是指使用Python的解析库对网页源代码进行解析,提取所需的数据;提取数据是指将解析后的数据进行处理,提取出需要的信息;存储数据是指将提取出的数据存储到本地或数据库中。 Python爬虫的实现需要用到一些...

    用Python来抓取一个新闻网站上的文章标题、链接、发布时间和内容

    在这个爬虫开发案例中,我们将使用Python的requests库和BeautifulSoup库来抓取一个新闻网站(如https://news.example.com)上的文章标题、链接、发布时间和内容。 代码解释 导入requests和BeautifulSoup库。 定义...

    Python爬虫算法-谷歌内部资料

    通常情况下,Python爬虫会利用第三方库如Requests、BeautifulSoup、Scrapy等来实现网页请求、数据解析和存储等功能。 Python爬虫的基本流程包括以下几个步骤: 1. 发起请求:使用Python发送HTTP请求到目标网站,...

    python爬虫讲解(1).docx

    解析网页是指使用Python的解析库对网页源代码进行解析,提取所需的数据;提取数据是指将解析后的数据进行处理,提取出需要的信息;存储数据是指将提取出的数据存储到本地或数据库中。 Python爬虫的实现需要用到一些...

    【Python爬虫】 基于 urllib 抓取指定贴吧的指定页的数据,保存到本地文件

    此资源是一个基于Python的爬虫脚本,利用urllib库抓取指定贴吧的指定页数据,并将抓取到的内容保存到本地文件中。该脚本可以帮助用户快速获取贴吧中的帖子标题、内容、发布时间等信息,并可以用于数据分析、内容提取...

    Python毕业设计-基于Python的二手房数据采集及可视化分析源码(高分期末大作业).zip

    2. 网页解析:利用Python的网络爬虫库(如Requests、BeautifulSoup等),解析目标网站的网页结构,获取二手房房产信息的网页元素。 3. 数据抓取:通过编写Python爬虫程序,按照设定的规则和算法从目标网站上抓取...

Global site tag (gtag.js) - Google Analytics