Page 207 - A Study on the Role of UGC Platforms in Copyright Law:An Intermediary-oriented Approach
P. 207

A Study on the Role of UGC Platforms in Copyright Law:   Chapter 7 Platform Users’ Entitlement to UGCs: Human Use and Web Scraping
 An Intermediary-oriented Approach





                  Chapter 7 Platform Users’ Entitlement to

                    UGCs: Human Use and Web Scraping




                 7.1 Introduction

                    Chapter 6 proposed some criteria to ensure the fairness of UGC platforms’ exploitation
                 of UGCs. Since UGCs are not only exploited by UGC platforms but also by platform
                 users, the current chapter explores the criteria for UGC platform users’ use of UGCs,
                 to strike a balance between the interests of UGC creators, UGC platforms and platform
                 users in exploiting UGCs. Under copyright law, any copying, distributing or making of a
                 derivative work of a copyrighted UGC without the consent of the UGC creator is a copyright
                 infringement unless it falls under the umbrella of fair use/fair dealing or other privileges. The
                 use of UGCs enjoys more safety valves than works that are not on UGC platforms because
                 the ToUs/ToSs of UGC platforms usually allow platform users to access and use UGCs
                               1
                 to some extent.  Nevertheless, ToUs/ToSs only cover certain types of use, usually non-
                 commercial use by human users. Section 7.2 proposes some guidelines to govern the use that
                 is not addressed by ToUs/ToSs or statutory privileges. 2
                    In addition to the difference between non-commercial use and commercial-use by human
                 users, the difference between human use and use by web scraping tools is important. Web
                 scraping tools are ‘web crawlers, bots, scraping tools, data mining tools, bulk downloading
                 tools, wget utility or similar data gathering and extraction tools’.  The process of gathering
                                                                         3
                 and extracting data from websites is called web scraping, web harvesting, screen scraping,
                 web crawling or web data extraction.  Compared with human browsing, web scraping is
                                                  4
                                     5
                 characterised by speed.  A human can only access one page at a time. In contrast, because
                 web scraping is automatically driven by machines, it can automatically extract an enormous
                 number of UGCs from many pages a thousand times faster than humans, with almost zero





                 1   Section 7.2.1.
                 2   The noncommercial UGC access levy scheme in Chapter 3 and the noncommercial UGC creation levy scheme in Chapter 4
                    are collectively named noncommercial UGCs levy schemes hereafter.
                 3   Terms of Service in Quora (23 October 2018), art 4d <https://www.quora.com/about/tos> accessed 19 May 2019; ToUs/ToSs
                    in AsianFanFics, YouTube, Dianping.com, IMDb, Craigslist, SoundCloud.
                 4   Alan Ross Mach. Corp. v. Machinio Corp., 2018 WL 3344364, at *5 (N.D. Ill. July 9, 2018).
                 5   Velotio Technologies, ‘Web Scraping: Introduction, Best Practices & Caveats’ (Medium, 11February 2019) <https://medium.
                    com/velotio-perspectives/web-scraping-introduction-best-practices-caveats-9cbf4acc8d0f> accessed 10 June 2019.



                                                                                          • 193 •
   202   203   204   205   206   207   208   209   210   211   212