Insights

Engineering notes, architectural deep-dives, and practical playbooks from the Devforth team.

Latest

Self-hosted GPT: real response time, token throughput, and cost on L4, L40S and H100 for GPT-OSS-20B
AI Infrastructure

Self-hosted GPT: real response time, token throughput, and cost on L4, L40S and H100 for GPT-OSS-20B

We benchmarked modern open-source LLMs across several popular GPUs to measure real-world context limits, throughput, latency, and cost efficiency under varying levels of concurrency — as close as possible to real production conditions. Here we share the results.
LLM Terminology Guide: Weights, Inference, Effective sequence length, and Self-Hosting Explained
AI Infrastructure

LLM Terminology Guide: Weights, Inference, Effective sequence length, and Self-Hosting Explained

A clear guide to generative AI and LLM terminology. Learn how model weights, quantization, inference, context length, batching, sampling and many more — including how to evaluate vendor APIs and self-host models like GPT-OSS-20B.
Unlimited CI build minutes without SaaS lock-In: A practical look at Woodpecker CI
DevOps & Reliability

Unlimited CI build minutes without SaaS lock-In: A practical look at Woodpecker CI

GitHub Actions pricing and policy changes have pushed teams to rethink CI ownership. This article explores Woodpecker CI as a self-hosted, open-source alternative that enables unlimited build minutes, predictable costs, and full infrastructure control.
OnLogs – open-source simplified web logs viewer for Dockers
OpenSource Contributions

OnLogs – open-source simplified web logs viewer for Dockers

Open-Source, simple, lightweight tiny web viewer for logs generated by Docker container. Use it for debugging your applications with simple logs access
ToBeDo - Simple Telegram checklist (ToDo) bot
OpenSource Contributions

ToBeDo - Simple Telegram checklist (ToDo) bot

ToBeDo is an open-source checklist bot created by Devforth. It transforms messages into a list with checkboxes. You can create checklists in direct messages with a bot or invite the bot into a group or channel as an admin.
GPT-J is a self-hosted open-source analog of GPT-3: how to run in Docker
AI Infrastructure

GPT-J is a self-hosted open-source analog of GPT-3: how to run in Docker

Learn how to setup open-source GPT-J model on custom cheapest servers with GPU. Try to run the text generation AI model of the future and talk to it right now!
URL monitoring made easy: self-hosted open-source tool for checking your website availability
DevOps & Reliability

URL monitoring made easy: self-hosted open-source tool for checking your website availability

You don’t need a complex stack to monitor availability. A lightweight, self-hosted approach to website uptime monitoring that scales with your needs
Nginx vs Traefik: how slower one can be better?
Architecture & Systems

Nginx vs Traefik: how slower one can be better?

Comprehensive review for popular HTTP routing proxy servers. Learn about weak and strong points of both and understand how to select one based on your project goals.
How to provision an OpenVPN server on Ubuntu for development and testing
DevOps & Reliability

How to provision an OpenVPN server on Ubuntu for development and testing

A step-by-step guide to deploying a self-hosted VPN for development, testing, and internal infrastructure access.