blog.dev.oxen.ai Open in urlscan Pro
76.76.21.123  Public Scan

URL: https://blog.dev.oxen.ai/
Submission: On February 03 via api from US — Scanned from US

Form analysis 0 forms found in the DOM

Text Content

 * Datasets
 * Community
 * Docs
 * FAQ
 * Blog
 * Pricing
 * Log in
 * Get Started


Datasets
Docs
Blog
Pricing
Search



LOG INGET STARTED


CURATE QUALITY DATASETS WITH CONFIDENCE

ImageAudioVideoTabularText

Whether it’s Image Recognition, Text Classification, or Generative AI, use
Oxen’s version control to enable powerful data workflows.

$ brew tap Oxen-AI/oxen
$ brew install oxen
$ oxen clone https://hub.oxen.ai/ox/CatDogBBox
🐂 Downloading 100.2 MB
🎉 Cloned to CatDogBBox/

Replay
OXEN DATASETS


NO MODEL WILL SAVE YOU FROM BAD DATA

Curating data is hard. We help you become a master Data Mixologist.

EXPLORE DATASETS

OXEN HUB


ADD TRANSPARENCY TO YOUR AI DATASETS

The ability to conceptualize, visualize, and see the actual data that trained a
model is critical

GET STARTED


TRACK CHANGES


DATASETS CHANGE OVER TIME

Datasets should be living, breathing assets that change over time. Oxen makes it
easy to compare changes in your data, so that you can have confidence moving
forward.

HOW IT WORKS



ENHANCE YOUR WORKFLOW

Use Oxen’s CLI & Oxen Hub to manage any and all datasets.

AUDIT

Oxen provides access controls and a visible change history to make
reproducibility, compliance, and privacy easier.

DISCOVER

Oxen provides a single source where you can search your organization’s machine
learning datasets as well as the robust #OpenData aggregated by the community.

EXPERIMENT

Oxen makes experimentation with datasets as easy as branch switching. Push and
pull data with the force of Oxen — up to 1000x faster than other solutions out
there.

MANAGE

Oxen simplifies how you interact with datasets, with familiar (“git-like”)
command line tools, a user-friendly web interface, and standardized data
parameters.

COLLABORATE

Oxen creates an easy to visualize and interact with history of dataset changes
allowing you to more easily experiment and share.

OPEN SOURCE

Oxen provides access controls and a visible change history to make
reproducibility, compliance, and privacy easier.


FROM THE OXEN HERD

DATA VERSION CONTROL 101 WITH OXEN

This intro tutorial from Oxen.ai shows how Oxen can make versioning your data as
easy as versioning your code. Oxen is built to track and store changes for
everything from a single CSV to data repositories with millions of unstructured
images, videos, audio or text files. The tutorial will go through what data
version control is, why it is important, and how Oxen helps data scientists and
engineers gain visibility and confidence when sharing data with the rest of
their team. Here's a video ve...

Greg Schoeninger
Nov 9, 2023

ARXIV DIVE MANIFESTO

Every Friday the team at Oxen.ai gets together and goes over research papers,
blog posts, or books that help us stay up to date with the latest in Machine
Learning and AI. We call it Arxiv Dives because https://arxiv.org/ is a great
resource for the latest research in the field. In September of 2023, we decided
to make it public so that anyone can join. We’ve had amazing minds from hundreds
of companies like Amazon, DoorDash, Meta, Google, and Tesla join the
conversation, but I thought it would...

Greg Schoeninger
Nov 5, 2023

HOW TO RUN LLAMA-2 ON CPU AFTER FINE-TUNING WITH LORA

Running Large Language Models (LLMs) on the edge is a fascinating area of
research, and opens up many use cases that require data privacy or lower cost
profiles. With libraries like ggml coming on to the scene, it is now possible to
get models anywhere from 1 billion to 13 billion parameters to run locally on a
laptop with relatively low latency. In this tutorial, we are going to walk step
by step how to fine tune Llama-2 with LoRA, export it to ggml, and run it on the
edge on a CPU. We assume...

Greg Schoeninger
Oct 23, 2023
Copyright © 2024 Oxen.ai, All Rights Reserved
FAQPrivacy PolicyTerms and Conditions