Break through blockchain complexity and get right to the data analysis.
TL;DR: Pinax datasets simplify blockchain data access and analysis by handling the technical complexities behind the scenes. Our datasets address key challenges like extraction complexity, heavy data volume, and tool incompatibility by providing pre-processed blockchain data in easily queryable formats. Users can access historical chain data from multiple blockchains through Snowflake Marketplace or S3 buckets, making blockchain data analysis accessible to developers and analysts without requiring specialized technical expertise.
Every time you type a query into Google, an intricate series of data processing steps unfold behind the scenes—yet we barely notice it happening.
We’re used to the technical aspects of accessing online data and information being removed from our experience.
But search engines like Google take numerous steps to produce our results. Your query goes to the search engine’s servers, which check the index. Then, the search engine ranks the results using search algorithms to produce the best, most relevant results. Finally, the results are personalized according to your location, language, and search history.
Pinax does something similar for blockchain datasets. We take care of the complex tasks in the background, so you get the blockchain data you need in a format that you can easily query and analyze.
Pinax datasets
Pinax datasets offer a simple way to access and apply blockchain data for a range of uses, from development to data analysis and verification. Our datasets remove the complexity so our customers can focus on their goals and what’s important to them.
What are the problems we’re solving?
In the video, Dillan lists several problems that blockchain data consumers can overcome with our datasets.
- Extraction complexity: Extracting blockchain data is hard. Doing so successfully requires technical expertise and tools that many people (even developers) do not have.
- Heavy data volume: Some blockchains have hundreds of gigabytes of transfers per month. Add that up over time, and you’re struggling with terabytes of data, which is challenging to manage.
- Storage demands: Expanding networks lead to significant storage needs and increased operational costs.
- Scalability burden: If you want to analyze data from multiple blockchains, you would have so much data that it would be operationally heavy to manage without robust infrastructure.
- Tool incompatibility: Many people, including data analysts, aren’t familiar with blockchain extraction tools.
What’s our solution?
- We have the technical expertise. We’ve been engineering blockchain analytics and processing tools since 2018, and we have deep experience operating Firehose and Substreams technology, so we can extract whatever data you need.
- We have the infrastructure. We operate our own robust infrastructure, managing bare metal hardware across multiple data centers. You can trust us to be reliable and fast.
- We make the data accessible. We transform the data into queryable tables and output it in Parquet files that are stored in S3 buckets. This format makes it easy for anyone to access, saving you time and effort learning to use complex tools.
- We make it easy to analyze the data. We provide multiple options for interacting with the data, depending on your preferred tools.
What makes our datasets stand out?
We provide datasets with full historical chain data, including blocks, transactions, logs, storage changes, and traces.
You can access data from multiple blockchains, like Ethereum, Base, Arbitrum One, BNB Chain, Polygon, Solana, and Antelope chains like WAX and EOS.
Our datasets are easy to access and query:
- We import Parquet files into our Snowflake managed database, and then you can interact with the dataset on Snowflake Marketplace.
- You interact with the Parquet files by querying our S3 endpoint to fetch the files you need and then use a query engine or database management system.
- You download the Parquet files locally and use Python data science libraries like pandas or Polars.
Datasets in action
Follow along with Dillan and learn more about an Ethereum dataset at 3:19 of the video.
Keep watching to see Pinax’s demo website and get an idea of what we’re working on and planning to offer.
Watch as he:
- Runs some sample queries in the Snowflake SQL playground.
- Uses AI to write SQL queries and runs them.
- Shows how the data explorer allows you to browse through the available fields.
What can Pinax datasets do for you?
Pinax datasets eliminate the technical barriers to accessing and analyzing blockchain data. Whether you’re a developer, data analyst, or hobbyist, our solution provides the infrastructure, expertise, and accessibility you need to focus on deriving insights rather than wrestling with data extraction and storage.
Ready to see Pinax datasets in action? Watch Dillan’s demo video to see how easy it is to query and analyze blockchain data using our tools. Then, explore our interactive demo website to get hands-on experience with our data explorer and sample queries.
No Comments