All posts tagged: Ollama

I quit ChatGPT for a free, private, and local AI called Ollama – here’s why

I quit ChatGPT for a free, private, and local AI called Ollama – here’s why

Jack Wallen/ZDNET Follow ZDNET: Add us as a preferred source on Google. ZDNET’s key takeaways Ollama is a user-friendly, locally installed AI. You will be surprised at the benefits you gain from this app. Not only is Ollama free and private, but it’s also open-source. Ollama is a small player in the AI game, but it should be much bigger than it is. This installable AI has several benefits you won’t find with the likes of ChatGPT, and those benefits are what keep me from ever using a more traditional service. I’ve written about Ollama quite often over the past year, but I thought it was time to share the reasons I choose to use this app over any other. Also: Want local vibe coding? This AI stack might replace Claude Code and Codex – for free (Disclosure: Ziff Davis, ZDNET’s parent company, filed an April 2025 lawsuit against OpenAI, alleging it infringed Ziff Davis copyrights in training and operating its AI systems.) What is Ollama? Before I get into the why, let’s talk about …

Ollama Now Runs Faster on Macs Thanks to Apple’s MLX Framework

Ollama Now Runs Faster on Macs Thanks to Apple’s MLX Framework

Ollama, the popular app for running AI models locally on a computer, has released an update that takes advantage of Apple’s own machine learning framework, MLX. The result is a hefty speed boost on Macs with Apple silicon. According to Ollama, the new version processes prompts around 1.6 times faster (prefill speed) and nearly doubles the speed at which it generates responses (decode speed). Macs with M5-series chips are said to see the largest improvements, thanks to Apple’s new GPU Neural Accelerators. The update also includes smarter memory management, which should make AI-powered coding tools and chat assistants feel noticeably more responsive during extended use. Ollama says the new performance boost should especially benefit macOS users who run personal assistants like OpenClaw or coding agents like Claude Code, OpenCode, or Codex. The preview release is available to download as Ollama 0.19 – just make sure you have a Mac with more than 32GB of unified memory to run it. Support is currently limited to Alibaba’s Qwen3.5, but Ollama says support for more AI models is …

AnythingLLM Self-Hosted AI Workspace Replaces Ollama & LangChain Tools

AnythingLLM Self-Hosted AI Workspace Replaces Ollama & LangChain Tools

AnythingLLM, demostrated by Better Stack below, offers a single self-hosted platform that consolidates the capabilities of Ollama, LangChain and custom UIs into a unified environment. Designed for developers working with large language models (LLMs), it supports tasks like document processing, codebase interaction and retrieval-augmented generation (RAG). With features such as a drag-and-drop interface, a visual workflow builder and compatibility with multiple model providers, it emphasizes privacy and flexibility while simplifying complex workflows. However, its high resource requirements and occasional workflow adjustments may present challenges for certain use cases. In this overview, you’ll explore how “AnythingLLM” enables streamlined RAG integration, supports isolated workspaces for managing multiple projects and allows dynamic model switching mid-conversation. You’ll also gain insights into its practical applications, from building private AI systems to developing client-facing solutions, as well as its limitations, such as hardware constraints. By the end, you’ll have a clear understanding of how this platform fits into modern AI development workflows. Streamlined AI Workflow Tool TL;DR Key Takeaways : Integrated Platform: “AnythingLLM” combines tools like Ollama, LangChain, vector databases and …