From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on gnuweeb.org X-Spam-Level: X-Spam-Status: No, score=-1.2 required=5.0 tests=ALL_TRUSTED,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,URIBL_BLOCKED autolearn=ham autolearn_force=no version=3.4.6 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=gnuweeb.org; s=default; t=1683117432; bh=/ANVW+I17umP3oeR4XAUDm3kL32mZmRKUHbXhtxI2Ic=; h=Date:From:To:Cc:Subject; b=oBYgKtCTYYGqbYecDwulbPt4igzkrZ4c6ZngxMHsr8xIX9wCoJhIMgyyNiwXXRtpp sQMjFf2zdR2LugEreZEwB9XvPpfSFWC/3XkME2ZH+vqriMR9XWDSeZRncjqlj0lJ5C USUvbdTGOKGuar2X9ZW8S+jNjkayNSpITqFxvb0Oh26MsSIwbW9XTlKDyFUOuctwLW kVn+tsbK7QvoTuJdSSO4F7HrxGHxH/I7v/M50shwbTWbc/sHNS/+AJL2XZnDayeAev Y183llQCduMVDwvix/fWAR3/ezvmyQ/695XI1LInDaL3py9Wzlu2/BHS8h02MX4GZd VG5KG+weBsNJw== Received: from biznet-home.integral.gnuweeb.org (unknown [182.253.183.197]) by gnuweeb.org (Postfix) with ESMTPSA id 0118D2451EC; Wed, 3 May 2023 19:37:10 +0700 (WIB) Date: Wed, 3 May 2023 19:37:07 +0700 From: Ammar Faizi To: GNU/Weeb Mailing List Cc: GNU/Weeb FB Team , Michael William Jonathan Subject: Introducing Facebook Scraper API (with Tor network support) Message-ID: MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline X-Bpl: hUx9VaHkTWcLO7S8CQCslj6OzqBx2hfLChRz45nPESx5VSB/xuJQVOKOB1zSXE3yc9ntP27bV1M1 List-Id: We are open-sourcing a new project: Facebook Scraper API with Tor network support. It's fully written in PHP (yeah PHP, feel free to argue PHP is dead: it's not!). Currently, it can only scrape text and photo posts. I'll be adding the video support in the near future. Many features will come soon. This project is licensed under the GPLv2 license, which is my default open-source license choice. Comments and patches, welcome... Patches and any inquiry about this project should be directed to: To: Ammar Faizi Cc: Michael William Jonathan Cc: GNU/Weeb Mailing List Cc: GNU/Weeb FB Team The following changes since commit 75065abdba76e40f102041b70c2edaf4bf902259: fb: Initial commit (2023-05-01 00:48:36 +0700) are available in the Git repository at: https://gitlab.torproject.org/ammarfaizi2/Facebook.git master for you to fetch changes up to 0d5e59e00359e165778a81f80122bb522f8edb0f: Merge branch 'rewrite_url' (Facebook Onion rewrite support) (2023-05-03 18:46:47 +0700) ---------------------------------------------------------------- Ammar Faizi (33): fb: Create the initial 'Post' trait (getTimelineYears) fb: Create user cache mechanism fb: Post: Handle a 'get timeline years' edge case fb: Post: Create getTimelinePosts method fb: Post: Make getTimelineYears() more reliable fb: Post: Implement getPost() function fb: web: Create initial web API fb: Post: Handle not found in getTimelineYears() fb: Post: Fix stupid indentation fb: web: Add getPost() endpoint fb: helpers: Replace '

' with double lines instead of single line fb: web: Create 'logs' directory for web server logs fb: composer.json: Remove phpunit from require-dev fb: Use CURLPROXY_SOCKS5_HOSTNAME as proxy type fb: helpers: Trim the end result of full_html_clean() fb: Post: Split parsing logic in getPost() fb: Post: Split info parser fb: Post: Implement tryParsePhotoPost() fb: Post: Introduce `$take_content` argument in `getTimelinePosts()` fb: Post: Introduce `$limit` argument in `getTimelinePosts()` fb: web: Invert the getTimelinePosts() condition fb: web: Integrate `$take_content` and `$limit` args fb: Post: Switch `content` and `info` key position fb: Post: Parse the embedded link in a post fb: web: Create `httpGet()` API for visiting FB onion endpoints fb: Post: Introduce rewrite URL callback fb: web: Provide a proxy to access onion endpoints fb: Post: Call cleanURL() on the img_preview URL fb: web: Fix `is_compressed` value fb: web: Supress gzinflate error fb: web: Don't rewrite non Facebook onion URL Merge branch 'post' (initial FB post scraper API) Merge branch 'rewrite_url' (Facebook Onion rewrite support) auth.example.php | 2 + composer.json | 3 - main.php | 45 +++++ src/Facebook/Facebook.php | 93 ++++++++- src/Facebook/Methods/Post.php | 422 ++++++++++++++++++++++++++++++++++++++++ src/Facebook/helpers.php | 54 +++++ web/.gitignore | 1 + web/logs/.gitignore | 2 + web/public/api.php | 268 +++++++++++++++++++++++++ 9 files changed, 885 insertions(+), 5 deletions(-) create mode 100644 src/Facebook/Methods/Post.php create mode 100644 web/.gitignore create mode 100644 web/logs/.gitignore create mode 100644 web/public/api.php -- Ammar Faizi