GetWeb

GetWeb is a Python package designed to simplify web scraping and HTTP requests. It provides utilities for fetching web pages, extracting content, and downloading files.

I actually made it because im to dumb for bs4 and requests
Note: This package was created to provide a simpler alternative to using libraries like BeautifulSoup and Requests directly. My Discord: Tamino1230

Features

Perform GET and POST requests with error handling.
Download files from URLs.
Extract text, links, and images from web pages.
Search for elements by tag, ID, or class using BeautifulSoup.
Fetch metadata, titles, and headings from web pages.
Retrieve HTTP headers from URLs.

Installation

Option 1: Manual Installation

Download or clone this repository.
Place the GetWeb folder in your project directory.
Ensure the GetWeb folder contains the getweb module.

Option 1.5: Insert Local Python Folder

To add the GetWeb package to your local Python environment, follow these steps:

Find the folder where Python is installed on your computer:
- Windows: Look for one of these paths:
  - C:\Users\<YourUsername>\AppData\Local\Programs\Python\Python<version>\Lib\site-packages
  - C:\Python<version>\Lib\site-packages / C:\Python<version>\Lib (if Python is installed directly in C:\Python)
- macOS/Linux: Check one of these paths:
  - /usr/local/lib/python<version>/site-packages
  - ~/.local/lib/python<version>/site-packages
Copy the GetWeb folder into the site-packages or Lib (only Windows) directory.
Verify the installation by running the following command in your Python environment:
```
import getweb
print("getweb package installed successfully!")
```

Option 2: Using `pip`

(Not yet available on PyPI. Use manual installation for now.)

Usage

Importing the Package

from getweb import getweb

Example: Fetching a Web Page

# Initialize the getweb object with a URL
web = getweb("https://example.com")

# Fetch the page content
web.fetch()

# Get the text content of the page
print(web.get_text())

# Get all links on the page
print(web.get_links())

# Get all image sources on the page
print(web.get_images())

Example: Searching for Elements

# Find all elements with a specific tag
print(web.find_by_tag("p"))

# Find an element by its ID
print(web.find_by_id("main-content"))

# Find all elements with a specific class
print(web.find_by_class("highlight"))

# Find elements by a specific attribute
print(web.find_by_attribute("data-role", "button"))

Example: Extracting Metadata and Headings

# Get all meta tags
print(web.get_meta_tags())

# Get the title of the page
print(web.get_title())

# Get all headings (h1 to h6)
print(web.get_headings())

# Get headings of a specific level (e.g., h2)
print(web.get_headings(level=2))

Example: Downloading a File

from getweb.response import download_file

# Download a file from a URL
success = download_file("https://example.com/file.zip", "file.zip")
if success:
    print("File downloaded successfully!")
else:
    print("Failed to download the file.")

Example: Fetching Headers

from getweb.response import get_headers

# Get headers from a URL
headers = get_headers("https://example.com")
if headers:
    print(headers)
else:
    print("Failed to fetch headers.")

Additional Utilities

The GetWeb package also includes utility functions for common web-related tasks:

Example: Validating a URL

from getweb.utils import is_valid_url

# Check if a URL is valid
url = "https://example.com"
if is_valid_url(url):
    print(f"{url} is valid!")
else:
    print(f"{url} is not valid.")

Example: Prettifying HTML

from getweb.utils import prettify_html

# Prettify raw HTML content
raw_html = "<html><body><h1>Title</h1></body></html>"
pretty_html = prettify_html(raw_html)
print(pretty_html)

Example: Extracting Emails

from GetWeb.getweb.utils import extract_emails

# Extract email addresses from text
text = "Contact us at support@example.com or sales@example.org." # text could be the HTML
emails = extract_emails(text)
print(emails)

Example: Extracting Phone Numbers

from getweb.utils import extract_phone_numbers

# Extract phone numbers from text
text = "Call us at +1-800-555-1234 or (123) 456-7890." # text could be the HTML
phone_numbers = extract_phone_numbers(text)
print(phone_numbers)

Example: Getting the Base URL

from getweb.utils import get_base_url

# Extract the base URL from a full URL
full_url = "https://example.com/path/to/resource"
base_url = get_base_url(full_url)
print(base_url)

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
getweb		getweb
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GetWeb

Features

Installation

Option 1: Manual Installation

Option 1.5: Insert Local Python Folder

Option 2: Using `pip`

Usage

Importing the Package

Example: Fetching a Web Page

Example: Searching for Elements

Example: Extracting Metadata and Headings

Example: Downloading a File

Example: Fetching Headers

Additional Utilities

Example: Validating a URL

Example: Prettifying HTML

Example: Extracting Emails

Example: Extracting Phone Numbers

Example: Getting the Base URL

About

Releases

Packages

Languages

License

Tamino1230/Package_getweb

Folders and files

Latest commit

History

Repository files navigation

GetWeb

Features

Installation

Option 1: Manual Installation

Option 1.5: Insert Local Python Folder

Option 2: Using pip

Usage

Importing the Package

Example: Fetching a Web Page

Example: Searching for Elements

Example: Extracting Metadata and Headings

Example: Downloading a File

Example: Fetching Headers

Additional Utilities

Example: Validating a URL

Example: Prettifying HTML

Example: Extracting Emails

Example: Extracting Phone Numbers

Example: Getting the Base URL

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Option 2: Using `pip`

Packages