Layer 1 · gdr-12324805
startree.ai
Latin dominant · moderate lexical diversity · mixed register · moderate clause complexity · moderate topic focus · strong uncommon edge signal
Schema: 35% AI · LIVE Minted: 2026-05-15 Visit Source ↗ manifest.json ↗
Entity Identity gdr-12324805 · minted 2026-05-15T20:56:27Z
◈ This record is claimable  ·  Verified entities are cited by AI systems
Claim Entity Profile →
AI · Entity Record
startree.ai
JSON-LD ✓Root-LD ✗schema.org ✓
STATUS: LIVE SSL: VALID SECURITY: MINIMAL FRESHNESS: RECENT TLD EDGE: .ai ↗
◈ Topology Position
Latin dominant · moderate lexical diversity · mixed register · moderate clause complexity · moderate topic focus · strong uncommon edge signal
◈ Entity Topology Map
gdr-12324805 · v1.0.0 · Law III+V+VI
67642aff74ea33e77fa7d40c7af81846startree.aigdr-12324805WebPageReadActionBreadcrumbLiListItemWebSiteSearchActionTTR0.42HAPAX0.254SKEW0.675PARA0.571TTR0.4203HAPAX0.2536REP0.7464SCHEMA34%TOKENS3,036NODES28SCHEMA TYPESTOPOLOGYGRAPH EDGESNEG SPACE
Latin dominant · moderate lexical diversity · mixed register · moderate clause complexity · moderate topic focus · strong uncommon edge signal
Federation ID
gdr-12324805
Slug
startree-ai
TLD
.ai
Status Code
200
Response Time
3908ms
Interior Pages
3
Interior Words
1,771
Minted At
2026-05-15T20:56:27Z
Law I — Provenance · Law II — Temporal Attestation Visit startree.ai ↗
SEO Record extracted from https://www.startree.ai/
Title
StarTree: The Platform for Continuous Insights
H1
The Platform for Continuous Insights
Meta Description
A managed real-time analytics platform designed for OLAP at massive speed and scale, built on top of Apache Pinot.
Canonical URL
https://startree.ai/
Language Attribute
en-US
Word Count
1265
Open Graph Tags
og:locale: en_USog:type: websiteog:title: Homeog:description: A managed real-time analytics platform designed for OLAP at massive speed and scog:url: https://startree.aiog:site_name: StarTree
Twitter / X Tags
twitter:card: summary_large_imagetwitter:title: Hometwitter:site: @startreedata
H2 (1)
Proven with the most demanding use cases
H2 (2)
For low-latency, high-concurrency analytics
H2 (3)
Fully managed, enterprise-grade
H2 (4)
DBaaS, BYOC or BYOK?
Full Extracted Text Corpus 20,920 chars · 3,036 words · 3 pages · Law I
Everything startree.ai said about itself — extracted verbatim from 3 pages, 3,036 words total. No editorial layer. No inference. Law III — the text is the measurement. Meaning is the reader's. Minted: 2026-05-15T20:56:27Z
◈ Homepage — https://www.startree.ai/Skip to main content Skip to footer May 20 - Webinar - Iceberg Performance Benchmark: StarTree vs. Trino vs. ClickHouse : RSVP Here Product Solutions Token-Level Observability for LLMs: How Together AI Does It BurdaForward: Analyzing Quality News Performance Using Real-Time Analytics Webex Meets Unprecedented Analytics Demand with Apache Pinot Dialpad Powers Real-Time Customer Intelligence with StarTree Cloud Taming the Kafka Firehose: How CrowdStrike is Scaling Real-Time Security Analytics with Pinot Token-Level Observability for LLMs: How Together AI Does It BurdaForward: Analyzing Quality News Performance Using Real-Time Analytics Webex Meets Unprecedented Analytics Demand with Apache Pinot Dialpad Powers Real-Time Customer Intelligence with StarTree Cloud Docs Resources Pricing Login Talk to a Pinot expert The Platform for Continuous Insights StarTree, powered by Apache Pinot, turns raw data streams—like clickstreams, logs, vector embeddings, and sensor data—into real-time analytics. It powers customer-facing data products, agent-facing data products, and operational intelligence. WATCH THE RECORDING Unlocking Revenue-Critical workloads on Iceberg Watch our recent webinar on how to leverage a purpose-built execution layer to achieve deterministic SLAs directly on Iceberg tables. Eliminate shadow stacks and restore architectural coherence for revenue-critical insights. Watch Now Proven with the most demanding use cases How Grab Built a Next-Gen Observability Stack for Its Rideshare Marketplace One of Southeast Asia’s largest makers of ‘super apps’ – is powering real-time observability in its rideshare marketplace with Apache Pinot. Its integrated platform supports operations across more than 300 cities with one-second response times. In this talk from Real-time … Learn more 10 Million Requests per month 1 second Query Latency (p95) 90% Growth Over six months Webex Meets Unprecedented Analytics Demand with Apache Pinot Seeking a real-time analytics solution to meet spiking customer demand for insights in the pandemic era, Webex chose Apache Pinot over Clickhouse and Elasticsearch. Learn more 100 tb data daily 100+ production nodes 4x faster queries Token-Level Observability for LLMs: How Together AI Does It As large language models (LLMs) become core infrastructure, a new engineering challenge is taking shape: capturing how these models are used—and how they behave—as it happens. It’s no longer sufficient to count API calls or log failures after the fact. … Learn more Sub-second Query Latency High-cardinality slicing and dicing 10 second freshness windows How Grab Built a Next-Gen Observability Stack for Its Rideshare Marketplace One of Southeast Asia’s largest makers of ‘super apps’ – is powering real-time observability in its rideshare marketplace with Apache Pinot. Its integrated platform supports operations across more than 300 cities with one-second response times. In this talk from Real-time … Learn more 10 Million Requests per month 1 second Query Latency (p95) 90% Growth Over six months Webex Meets Unprecedented Analytics Demand with Apache Pinot Seeking a real-time analytics solution to meet spiking customer demand for insights in the pandemic era, Webex chose Apache Pinot over Clickhouse and Elasticsearch. Learn more 100 tb data daily 100+ production nodes 4x faster queries Token-Level Observability for LLMs: How Together AI Does It As large language models (LLMs) become core infrastructure, a new engineering challenge is taking shape: capturing how these models are used—and how they behave—as it happens. It’s no longer sufficient to count API calls or log failures after the fact. … Learn more Sub-second Query Latency High-cardinality slicing and dicing 10 second freshness windows How Grab Built a Next-Gen Observability Stack for Its Rideshare Marketplace One of Southeast Asia’s largest makers of ‘super apps’ – is powering real-time observability in its rideshare marketplace with Apache Pinot. Its integrated platform supports operations across more than 300 cities with one-second response times. In this talk from Real-time … Learn more 10 Million Requests per month 1 second Query Latency (p95) 90% Growth Over six months See more case studies APACHE PINOT For low-latency, high-concurrency analytics FAST Sub-second analytic queries Blazing fast query speeds through innovative indexing techniques that send near instant responses — even at high volume &emdash; with a wide variety of data types including JSON, vector, timestamps and geospatial. More FRESH Insights on live data Continuous ingestion keeps data flowing. Indexes are updated as data lands – in real-time. Pinot is proven to ingest 1M events/s, while also maintaining high throughput for queries. More HIGH CONCURRENCY Extreme queries per second Support rich user-facing analytic applications. Pinot is capable of serving hundreds of thousands of simultaneous queries across huge and changing datasets without performance bottlenecks. More AFFORDABLE Low compute and storage costs Pinot has been engineered to scan and move less data, so it’s faster AND cheaper. Innovative optimizations such as data partitioning strategies, segment assignment strategies, and smart query routing reduce compute and storage costs. More Apache Pinot is a rapidly growing and thriving open-source project. More STARTREE CLOUD Fully managed, enterprise-grade StarTree extends Apache Pinot into a managed data platform with enterprise-grade features so you can focus on insights, not infrastructure. StarTree adds security, scalability, and many additional capabilities: Interactive queries on the data lake Precision fetch allows for interactive query on data in object storage — directly, and without large data transfers. More Easier Administration Eliminate unnecessary infrastructure costs during idle times with autoscaling; also, ease of cluster, instance, and storage management.  More Scalable Upserts Off-heap upsert functionality can handle updates to billions of primary keys per server without sacrificing performance. More Enterprise Security Additional security features, including RBAC, advanced data encryption, single sign-on, and SOC 2 and ISO 27001 compliance ensure your data is safe and protected. More Improved data management StarTree's Data Portal UI adds capabilities and makes it easier to ingest and manage data. Evolve schema, tune performance and backfill data without disrupting operations. More Improved data management Connect with other enterprise data sources, such as Confluent, Snowflake and Tableau with additional StarTree integrations. More DEPLOYMENT OPTIONS DBaaS, BYOC or BYOK? StarTree Cloud offers public SaaS (Software as a Service) and private BYOC/BYOK deployment options. For more information about our public or private SaaS offering, schedule a demo with our StarTree solutions architect team. STARTREE CLOUD SaaS Get started quickly with the fully managed, StarTree Cloud service. Ideal for organizations that want zero ops overhead and rapid time-to-value. Setup is simple, leaving you to focus on your data, your schema, and queries. BRING YOUR OWN CLOUD BYOC For customers who prefer dedicated infrastructure and deeper integration with enterprise systems, a Bring Your Own Cloud (BYOC) model includes VPC peering, custom SLAs, and enhanced observability. BRING YOUR OWN KUBERNETES BYOK For highly regulated environments, StarTree can also be deployed behind your firewall—offering full control while still leveraging the power and capabilities of StarTree Cloud. Deployment Options: Learn More The benefits of migrating to StarTree and Pinot center around three areas: query performance, SLAs, and cost. With Pinot, we are now in the subseconds to milliseconds range, the higher query concurrency means we can serve more customers faster, and we’ve been able to reduce our infrastructure costs. Joanes Espanol CTO and Co-Founder GET THE GUIDE Real-time analytics at scale Leading organizations like Uber and Stripe are harnessing real-time insights to efficiently power customer-facing data products. This comprehensive report details how they do it. Download your Digital Copy Today First Name* Last Name* Company Name* Business Email* I have questions for a product expert. I agree to the StarTree Community Terms and Conditions.* PLATFORM Apache Pinot StarTree Cloud StarTree ThirdEye Deployment Options Pricing SOLUTIONS Real-time Analytics Case Studies Observability Fraud Detection Customer-Facing Analytics FOR DEVELOPERS Docs Community Support Releases RTA Summit ABOUT About Careers Partners Press & Media Contact Legal Sign up for our newsletter... © 2025 StarTree Inc. All rights reserved. Trademark Policy Privacy Policy Terms of Use Responsible Disclosure ◈ Interior Pages — 3 pages crawledStartree release notes - StarTree Docs Skip to main content StarTree Docs home page Search... ⌘ K Ask AI Go to StarTree.ai Talk to us Talk to us Search... Navigation Home StarTree Cloud ThirdEye Apache Pinot Recipes AI API Reference Community Blogs Videos Get Started StarTree Cloud What is Real-Time Analytics? Ingestion Overview Connectors Data Modeling Additional Configuration Advanced Operations Recipes StarTree Iceberg/S3 Tables Onboarding via Data Portal Observability Data Type Mapping Indexes Data and Index caching Query Data Query Languages Query Interfaces Query Functions Advanced Operations Query Analyzer StarTree MCP server Manage Data Indexes Upserts Off-Heap Dedup Cloud Tiered Storage Hybrid Tables Data Operations Recipes Visualize Data Tableau Superset Manage Security Overview Authorization API Token Management Pinot Audit Logging Configure IdP Release Notes Data Portal 0.15.0 0.14.0 Older Releases Reference Glossary Cluster Health StarTree Cloud V2 Benefits On this page StarTree release notes Startree release notes Documentation Index Fetch the complete documentation index at: https://docs.startree.ai/llms.txt Use this file to discover all available pages before exploring further. ​ StarTree release notes 0.12.0 : February 2026 0.11.1 : September 2025 0.11.0 : July 2025 0.10.1 : February 2025 0.10.0 : November 2024 0.9.0 : June 2024 0.8.1 : April/May 2024 0.8.0 : February 2024 0.7.2 : December 2023 0.7.1 : November 2023 0.7.0 : August 2023 0.6.1 : April 2023 0.6.0 : February 2023 0.5.0 : June - August 2022 0.4.0 : April - May 2022 Was this page helpful? Yes No ⌘ I StarTree Docs home page x youtube github linkedin Trademarks Privacy Terms of Use x youtube github linkedin x youtube github linkedin Powered by This documentation is built and hosted on Mintlify, a developer documentation platform Assistant Responses are generated using AI and may contain mistakes. Download Recipes - StarTree Docs Skip to main content StarTree Docs home page Search... ⌘ K Ask AI Go to StarTree.ai Talk to us Talk to us Search... Navigation Recipes Download Recipes Home StarTree Cloud ThirdEye Apache Pinot Recipes AI API Reference Community Blogs Videos Recipes Download Recipes Batch Ingestion Streaming Ingestion Transformation Functions Deep Storage Upserts Real-Time-Offline Job JSON Documents Geospatial Merge and Roll Up Clickstream Analytics Dashboard Other On this page Clone the pinot-recipes repository using SSH Clone the pinot-recipes repository using HTTPS Recipes Download Recipes Documentation Index Fetch the complete documentation index at: https://docs.startree.ai/llms.txt Use this file to discover all available pages before exploring further. Recipes help you learn how to solve common problems with Apache Pinot. To download Apache Pinot recipes, do one of the following: Clone the repository using SSH Clone the repository using HTTPS If you don’t have a Git client or don’t want to clone the repository, click to download a zip file that contains the recipes. ​ Clone the pinot-recipes repository using SSH To clone the pinot-recipes repository using SSH, do the following: Set up SSH keys associated with your GitHub account on your local machine. For more information, see Connecting to GitHub with SSH . Run the following command: git clone [email protected]:startreedata/pinot-recipes.git ​ Clone the pinot-recipes repository using HTTPS To clone the pinot-recipes repository using HTTPS, do the following: Run the following command: git clone https://github.com/startreedata/pinot-recipes.git At the prompt, enter your GitHub username and a personal access token. If you forgot your GitHub username, see Remembering your GitHub username or email . If you need a personal access token, see Creating a personal access token . Was this page helpful? Yes No CSV with Column Spaces ⌘ I StarTree Docs home page x youtube github linkedin Trademarks Privacy Terms of Use x youtube github linkedin x youtube github linkedin Powered by This documentation is built and hosted on Mintlify, a developer documentation platform Assistant Responses are generated using AI and may contain mistakes. Taming the Kafka Firehose: How CrowdStrike is Scaling Real-Time Security Analytics with Pinot | StarTree Settings --> Settings --> Skip to main content Skip to footer May 20 - Webinar - Iceberg Performance Benchmark: StarTree vs. Trino vs. ClickHouse :  RSVP Here Product What is Apache Pinot? An open-source database for real-time analytics at scale and speed. More Apache Pinot on StarTree Cloud StarTree extends Apache Pinot as a fully managed service, with enterprise features and improved performance. Overview Architecture Features Security & Compliance Releases Deployment Options Iceberg Tables StarTree Cloud now offers reliable, low-latency query on the data lake. More Solutions Learn about the foundations behind customer-facing dashboards, agent-facing analytics, and observability systems. More By Industry Finance Healthcare Gaming Retail Transportation & Delivery By Use Case Customer Facing Analytics Observability Machine Learning Fraud Detection Interactive Analytics for the Lakehouse Case Studies (See all) Webex Meets Unprecedented Analytics Demand with Apache Pinot Dialpad Powers Real-Time Customer Intelligence with StarTree Cloud Taming the Kafka Firehose: How CrowdStrike is Scaling Real-Time Security Analytics with Pinot Token-Level Observability for LLMs: How Together AI Does It BurdaForward: Analyzing Quality News Performance Using Real-Time Analytics Docs Request a Trial Start with StarTree Cloud! Our onboarding team will help you succeed! Get Started Documentation Ingestion Query Data Indexes Tiered Storage Security Releases API Reference Get Involved Community Support Apache Pinot 101 Pinot Recipes Resources Grafana has become the de facto standard for visualizing operational metrics, thanks to its rich ecosystem and intuitive PromQL-based query… Resources Blogs In the News Events & Webinars Videos Case Studies Comparisons Pinot vs Druid vs Clickhouse Pinot vs ElasticSearch Pinot vs pre-aggregated KV Stores Pinot vs Time-Series Databases Pinot vs StarTree? Pricing Login Book a Demo Login Talk to&nbspa Pinot expert CASE STUDY Taming the Kafka Firehose: How CrowdStrike is Scaling Real-Time Security Analytics with Pinot CrowdStrike leverages Apache Pinot’s real-time distributed OLAP datastore to support security analytics handling 25k QPS across 10 clusters. 120,000 events per second 25,000 queries per second 5 billion events (on one table) In this talk at RTA Summit 2025, Yusuf Kulah shares how they do it! CrowdStrike’s backend runs on dozens of microservices, many of which consume a shared firehose of events via the Kafka distributed event streaming platform, said Yusuf Külah, a CrowdStrike senior engineer, in a technical session at the Real-Time Analytics Summit hosted by StarTree. Kafka acts as a central messaging system, broadcasting a stream of data like system metrics from endpoint agents. Managing variable loads The problem facing Crowdstrike was when Kafka topics experienced highly variable loads, often spiking when new customers came on board or when malware caused a surge in system events. Microservices like front-end request handlers must respond quickly in such situations, while others take longer to digest data. To keep service-level objectives (SLOs) intact, the team built a Pinot-powered monitoring layer that tracks event volume in real-time and publishes signals to a data bus when thresholds are crossed. This “traffic cop” approach helps services implement throttling or other mitigation plans dynamically. Supporting security analytics Using the same Kafka event stream, CrowdStrike applied Pinot to a second challenge: detecting data exfiltration. Crowdstrike aggregates and deduplicates events like file transfers or outbound network activity in real-time to ensure analysts get a clean feed of what’s leaving the system and where it’s going. Security analysts can monitor a dashboard to slice and dice data by destination, source, or other dimensions. This enables them to quickly pinpoint, for example, a rogue internal user shipping data to an unapproved endpoint. Wrestling with Protobuf complexity CrowdStrike’s events are encoded in Protobuf, a compact data serialization format that lets them define structured data using schemas, including nested types and enumerations. While Protobuf is great for performance and consistency, it’s less effective when schemas are evolving, deeply nested, and not uniform across environments. A bigger problem was that Protobuf changes forced restarts of Pinot servers, since Pinot had to reload descriptors with each change. Parsing complex Protobufs was slow, especially when only part of the message was relevant. Crowdstrike’s solution was to introduce a preprocessing layer. This component takes complex Protobuf events, extracts what’s needed, and emits a “lean” JSON version aligned with Pinot’s schema. It offloads heavy parsing, improves performance, skips unneeded fields, and avoids Pinot restarts. The tradeoff is that Kafka carries a duplicate lean event stream, which adds storage cost. Crowdstrike determined that benefits outweighed the bloat, however. Kafka’s dynamic addressing challenge In a dynamic infrastructure, Kafka message broker addresses change frequently, challenging Pinot to keep up. Failure to refresh broker lists in real-time risks ingestion delays or data loss, which are unacceptable in security analytics. DNS-based discovery was an obvious solution but not an option due to internal security policies, so CrowdStrike built an automated broker updater. This keeps Pinot’s configuration in sync with Kafka’s current state, ensuring smooth ingestion even as the cluster evolves. As a bonus, the updater mechanism also helped prevent configuration drift. If a Pinot table’s settings deviated from the official configuration, the updater would automatically realign it. Turning point-in-time queries into feeds Pinot is fast, but single queries are just snapshots. CrowdStrike needed a continuous data feed for some services, especially those that make decisions based on trends, such as a spike in data egress volumes over a defined period of time. Running hundreds or thousands of point-in-time queries from microservice replicas would have been wasteful and redundant, so the CrowdStrike team built a query scheduler to convert snapshots into continuous feeds. The scheduler deduplicates queries, reduces processing load, and enables real-time context. The big picture The system comprises four layers as illustrated: Data sources are primarily Kafka, but also include some batch feeds. The Preprocessor parses Protobufs and emits lean JSON events. The Apache Pinot cluster is the core analytics engine. It ingests lean events into real-time tables and optionally into batch-aggregated offline tables for long-term queries. Data consumers are the downstream services and dashboards. Some query Pinot directly, while others rely on the continuous feed via the query scheduler. Pinot is deployed in a multi-tenant setup, with some servers dedicated per tenant depending on the workload. This design balances performance and isolation. The results CrowdStrike’s Pinot deployment has been able to handle whatever real-time data is thrown at it. A single table replica handles up to 120,000 events per second. One table has seen over 5 billion ingested events. The busiest table supports up to 25,000 queries per second. All of this runs across five global production environments and more than 10 Pinot clusters. CrowdStrike’s real-time analytics system isn’t just fast; it’s resilient, scalable, and purpose-built for high-stakes cybersecurity work. From preprocessing complex data to automating infrastructure adaptation, the team’s approach demonstrates how thoughtful engineering can turn raw event firehoses into actionable insights. Learn more StarTree provides Apache Pinot as a managed service with many extended capabilities. Book a meeting with one of our experts to learn more a
◈ Crawled Pages — Provenance Chain
https://www.startree.ai/hhttps://startree.ai/event/iceberg-query-performance-benchmark-startree-vs-trino-vs-clickhouse-webinar/https://docs.startree.ai/https://docs.startree.ai/api-reference/introductionhttps://docs.startree.ai/concepts/about-startree-cloudhttps://docs.startree.ai/concepts/what-is-real-time-analyticshttps://docs.startree.ai/corecapabilities/ai/overviewhttps://docs.startree.ai/corecapabilities/external-table/data-and-index-cachinghttps://docs.startree.ai/corecapabilities/external-table/data-type-mappinghttps://docs.startree.ai/corecapabilities/external-table/indexeshttps://docs.startree.ai/corecapabilities/external-table/observabilityhttps://docs.startree.ai/corecapabilities/external-table/onboarding-data-portalhttps://docs.startree.ai/corecapabilities/ingestdata/dataportal/additional-config-create-tablehttps://docs.startree.ai/corecapabilities/ingestdata/ingest_data_overviewhttps://docs.startree.ai/corecapabilities/manage-data/hybrid-tableshttps://docs.startree.ai/corecapabilities/manage-data/indexes/overviewhttps://docs.startree.ai/corecapabilities/manage-data/offheap-deduphttps://docs.startree.ai/corecapabilities/manage-data/set-up-tiered-storage/architecturehttps://docs.startree.ai/corecapabilities/query_data/mcp/startree-mcphttps://docs.startree.ai/corecapabilities/query_data/query_interfaces/query_consolehttps://docs.startree.ai/corecapabilities/security/audit/pinot-audit-logshttps://docs.startree.ai/corecapabilities/security/manage-api-tokenshttps://docs.startree.ai/corecapabilities/security/manage-securityhttps://docs.startree.ai/corecapabilities/visualize_data/connect-superset-to-startree-cloudhttps://docs.startree.ai/corecapabilities/visualize_data/connect-tableau-to-startree-cloudhttps://docs.startree.ai/getstarted/deployment/components_deployedhttps://docs.startree.ai/getstarted/deployment/deployment_modelshttps://docs.startree.ai/getstarted/deployment/security_certificationshttps://docs.startree.ai/getstarted/productshttps://docs.startree.ai/llms.txthttps://docs.startree.ai/recipes/csv-files-spaces-column-nameshttps://docs.startree.ai/recipes/real-time-clickstream-dashboard-guidehttps://docs.startree.ai/recipes/recipeshttps://docs.startree.ai/reference/cloud-v2-benefitshttps://docs.startree.ai/reference/glossaryhttps://docs.startree.ai/reference/startree-release-noteshttps://docs.startree.ai/reference/startree-release-notes/https://docs.startree.ai/reference/startree-release-notes/0.14.0https://docs.startree.ai/reference/startree-release-notes/0.15.0https://docs.startree.ai/reference/startree-release-notes/data-portalhttps://docs.startree.ai/reference/startree-release-notes/startree-release-notes/0.10.0.md+88 more
Law I — Provenance · Law III — Reverse Ontology · source: https://www.startree.ai/ Visit Source ↗
Root-LD — Traveling Context Pod v1.0 · gdr-12324805 · three layers
40
Graph Edges
3,036
Tokens Measured
0.4203
Type-Token Ratio
3
Schema Blocks
35%
Schema Coverage
Root-LD is the traveling context pod for this entity — permanent, provenance-grounded. The head <script> block is machine-readable. This section shows the same data to humans. We show the work in both spaces.
Layer 1 — Anchor · Immutable after mint. UUID, federation_id, content hash, timestamps. A new crawl appends to recursive — the anchor is never touched. Law I — Provenance.
rld:anchor — gdr-12324805
{
  "uuid": "12324805-0b27-405b-aeac-9b6e9ed92013",
  "federation_id": "gdr-12324805",
  "sequence": 0,
  "content_hash": "94293dadc50366f8aff4693eebff596657e934aa34d16b70dc32d0bab5bf8888",
  "primary_source": "https://www.startree.ai/",
  "source_verified": true,
  "generation_method": "crawl_extract_v1",
  "spec_version": "1.0",
  "queued_at": "2026-05-15T20:56:27.370668+00:00",
  "minted_at": "2026-05-15T20:56:27.370668+00:00"
}
Layer 2 — Body · Complete measurement snapshot frozen at mint. Identity, SEO, schema graph, six-layer topology fingerprint, ratio signals, navigation. Law II — Temporal Attestation.
rld:body — startree.ai
{
  "domain": "startree.ai",
  "canonical_url": "https://www.startree.ai/",
  "tld": "ai",
  "slug": "startree-ai",
  "status_code": 200,
  "redirect_chain": [],
  "response_time_ms": 3908,
  "ssl_valid": true,
  "server_header": "cloudflare",
  "title": "StarTree: The Platform for Continuous Insights",
  "h1": "The Platform for Continuous Insights",
  "meta_description": "A managed real-time analytics platform designed for OLAP at massive speed and scale, built on top of Apache Pinot.",
  "lang_declared": "en-US",
  "schema_types": [
    "WebPage",
    "ReadAction",
    "BreadcrumbList",
    "ListItem",
    "WebSite",
    "SearchAction",
    "EntryPoint",
    "PropertyValueSpecification",
    "Organization",
    "ImageObject"
  ],
  "schema_score": 0.3452,
  "schema_prop_count": 29,
  "schema_gap_list": [
    "significantLink",
    "mainContentOfPage",
    "reviewedBy",
    "speakable",
    "lastReviewed",
    "specialty",
    "relatedLink",
    "funding",
    "provider",
    "genre",
    "wordCount",
    "accessModeSufficient",
    "acquireLicensePage",
    "temporalCoverage",
    "thumbnail",
    "commentCount",
    "displayLocation",
    "archivedAt",
    "digitalSourceType",
    "assesses"
  ],
  "top_semantic_words": [
    "pinot",
    "data",
    "startree",
    "real",
    "analytics",
    "apache",
    "recipes",
    "query",
    "cloud",
    "crowdstrike",
    "second",
    "observability",
    "using",
    "security",
    "queries",
    "performance",
    "kafka",
    "events",
    "github",
    "docs",
    "clone",
    "customer",
    "talk",
    "built",
    "per",
    "token",
    "llms",
    "platform",
    "facing",
    "documentation",
    "repository",
    "insights",
    "infrastructure",
    "storage",
    "webex",
    "demand",
    "operations",
    "across",
    "latency",
    "ingestion"
  ],
  "ratio_signals": {
    "schema_density": 0.725,
    "nav_ratio": 0.4922,
    "content_to_structure_ratio": 0.015747,
    "external_tld_diversity": 3,
    "self_declaration_coherence": 0.4867,
    "schema_to_navigation_alignment": 0.0,
    "javascript_surface_ratio": 0.0,
    "url_depth_distribution": {
      "depth_0": 6,
      "depth_1": 23,
      "depth_2": 55,
      "depth_3plus": 44
    }
  },
  "semantic_html_ratio": 0.0,
  "javascript_surface_ratio": 0.0,
  "img_alt_coverage": 0.0,
  "robots_complexity_score": 0,
  "ariadne_blocked": false,
  "security_label": "MINIMAL",
  "https_enforced": true,
  "freshness_label": "RECENT",
  "tld_starjet_url": "https://globaldataregistry.com/registry/tld/ledger/ai",
  "schema_starjet_urls": [
    "https://globaldataregistry.com/registry/schema/ledger/webpage",
    "https://globaldataregistry.com/registry/schema/ledger/readaction",
    "https://globaldataregistry.com/registry/schema/ledger/breadcrumblist",
    "https://globaldataregistry.com/registry/schema/ledger/listitem",
    "https://globaldataregistry.com/registry/schema/ledger/website",
    "https://globaldataregistry.com/registry/schema/ledger/searchaction",
    "https://globaldataregistry.com/registry/schema/ledger/entrypoint",
    "https://globaldataregistry.com/registry/schema/ledger/propertyvaluespecification",
    "https://globaldataregistry.com/registry/schema/ledger/organization",
    "https://globaldataregistry.com/registry/schema/ledger/imageobject"
  ],
  "native_text_sample": "Skip to main content\nSkip to footer\nMay 20 - Webinar - Iceberg Performance Benchmark: StarTree vs. Trino vs. ClickHouse : RSVP Here\nProduct\nSolutions\nToken-Level Observability for LLMs: How Together AI Does It\nBurdaForward: Analyzing Quality News Performance Using Real-Time Analytics\nWebex Meets Unprecedented Analytics Demand with Apache Pinot\nDialpad Powers Real-Time Customer Intelligence with StarTree Cloud\nTaming the Kafka Firehose: How CrowdStrike is Scaling Real-Time Security Analytics with",
  "topology_fingerprint_version": "1.0.0"
}
Layer 3 — Recursive · Empty at mint. Grows forever through accumulated corpus passes. Common edges (Law V), uncommon edges (Law VI), topology cluster scores. The graph builds itself. Law VII — Torus.
rld:recursive — edge_count=0
{
  "edges": [],
  "appended_at": [],
  "edge_count": 0
}
Root-LD v1.0 · root-ld.org · Law I+II+VII root-ld.org ↗
Schema.org Intelligence scored · graph traversal · Law VI negative space
35% coverage · 10 types · 29 props · 52 gaps · click to expand
35%
Schema Utilization Score
PARTIAL COVERAGE — GAPS IDENTIFIED
schema.org v2.0.0 · 29 props extracted · 52 gaps · https://www.startree.ai/
CreativeWorkWebPageReadActionBreadcrumbListListItemWebSite
◈ Schema Graph — Three-Direction Traversal
Declared: WebPage · ReadAction · BreadcrumbList · ListItem · WebSite · SearchAction · EntryPoint · PropertyValueSpecification · Organization · ImageObject
✓ Implemented
urlownhttps://startree.ai/
nameownStarTree: The Platform for Continuous Insights
isPartOfownhttps://startree.ai/#website
aboutownhttps://startree.ai/#organization
datePublishedown2022-12-08T19:02:27+00:00
dateModifiedown2026-04-06T18:01:16+00:00
descriptionownA managed real-time analytics platform designed for OLAP at massive speed and scale, built on top of Apache Pinot.
breadcrumbownhttps://startree.ai/#breadcrumb
inLanguageownen-US
potentialActionown[ReadAction]
targetownhttps://startree.ai/
itemListElementownHome
positionown1
publisherownhttps://startree.ai/#organization
query-inputown[PropertyValueSpecification]
urlTemplateownhttps://startree.ai/?s={search_term_string}
valueRequiredownTRUE
valueNameownsearch_term_string
logoownhttps://startree.ai/#/schema/logo/image/
imageownhttps://startree.ai/#/schema/logo/image/
sameAsownhttps://x.com/startreedata (+2 more)
contentUrlownhttps://startree.ai/wp-content/uploads/2025/07/startree-logo-dark.png
widthown207
heightown36
captionownStarTree
creatorownMintlify
primaryImageOfPageownhttps://startree.ai/user-stories/crowdstrike-scaling-realtime-security-analytics-with-pinot/#primaryimage
thumbnailUrlownhttps://startree.ai/wp-content/uploads/2025/08/rt-crowdstrikel-talk.jpg
itemownhttps://startree.ai/
✗ Not Implemented / Gap
slogangap
areaServedgap
emailgap
legalNamegap
keywordsgap
aggregateRatinggap
knowsAboutgap
telephonegap
contactPointgap
openingHoursgap
hasOfferCataloggap
addressgap
priceRangegap
alternateNamegap
identifiergap
numberOfEmployeesgap
geogap
foundingDategap
significantLinkgap
mainContentOfPagegap
reviewedBygap
speakablegap
lastReviewedgap
specialtygap
relatedLinkgap
fundinggap
providergap
genregap
wordCountgap
accessModeSufficientgap
acquireLicensePagegap
temporalCoveragegap
thumbnailgap
commentCountgap
displayLocationgap
archivedAtgap
digitalSourceTypegap
assessesgap
CreativeWorkancestor +1schema.org/CreativeWork ↗9/111 (8%)
The most generic kind of creative work, including books, movies, photographs, software programs, etc.
publisherthumbnailUrldatePublishedinLanguagecreatoraboutpositionisPartOfdateModified
fundingprovidergenrewordCountaccessModeSufficientacquireLicensePagetemporalCoveragethumbnailcommentCountdisplayLocation
Thingancestor +2schema.org/Thing ↗6/13 (46%)
The most generic type of item.
sameAsnamepotentialActiondescriptionurlimage
additionalTypeidentifierownersubjectOfmainEntityOfPagealternateNamedisambiguatingDescription
SoftwareApplicationsibling via CreativeWorkschema.org/SoftwareApplication ↗23 exclusive
A software application.
fileSizecountriesSupportedfeatureListprocessorRequirementsstorageRequirementsreleaseNotessoftwareRequirementsoperatingSystem
MediaObjectsibling via CreativeWorkschema.org/MediaObject ↗18 exclusive
A media object, such as an image, video, audio, or text object embedded in a web page or a downloadable dataset i.e. DataDownload. Note that a creative work may
durationassociatedArticleheightstartTimeplayerTypesha256uploadDateineligibleRegion
VisualArtworksibling via CreativeWorkschema.org/VisualArtwork ↗13 exclusive
A work of art that is primarily visual in character.
weightheightcoloristartworkSurfaceartistartformdepthartEdition
CreativeWorkSeasonsibling via CreativeWorkschema.org/CreativeWorkSeason ↗10 exclusive
A media season, e.g. TV, radio, video game etc.
endDateseasonNumberactorepisodestartDatetrailernumberOfEpisodespartOfSeries
Coursesibling via CreativeWorkschema.org/Course ↗10 exclusive
A description of an educational course which may be offered as distinct instances which take place at different times or take place at different locations, or b
numberOfCreditssyllabusSectionseducationalCredentialAwardedhasCourseInstancetotalHistoricalEnrollmentfinancialAidEligiblecoursePrerequisitesoccupationalCredentialAwarded
MusicCompositionsibling via CreativeWorkschema.org/MusicComposition ↗10 exclusive
A musical composition.
recordedAsfirstPerformancelyricsmusicArrangementlyricistcomposeriswcCodemusicCompositionForm
TVSeriessibling via CreativeWorkschema.org/TVSeries ↗10 exclusive
CreativeWorkSeries dedicated to TV broadcast and associated online delivery.
containsSeasonmusicByactornumberOfSeasonsepisodetrailertitleEIDRnumberOfEpisodes
Reviewsibling via CreativeWorkschema.org/Review ↗9 exclusive
A review of an item - for example, of a restaurant, movie, or store.
itemReviewedassociatedClaimReviewpositiveNotesreviewAspectnegativeNotesassociatedMediaReviewreviewRatingassociatedReview
Clipsibling via CreativeWorkschema.org/Clip ↗9 exclusive
A short TV or radio program or a segment/part of a program.
endOffsetmusicBypartOfSeasonclipNumberactorpartOfSeriespartOfEpisodestartOffset
Episodesibling via CreativeWorkschema.org/Episode ↗9 exclusive
A media episode (e.g. TV, radio, video game) which can be part of a series or season.
durationmusicBypartOfSeasonactortrailerepisodeNumberpartOfSeriesproductionCompany
Messagesibling via CreativeWorkschema.org/Message ↗9 exclusive
A single message from a sender to one or more organizations or people.
toRecipientccRecipientdateReceivedrecipientdateReaddateSentbccRecipientmessageAttachment
HowTosibling via CreativeWorkschema.org/HowTo ↗8 exclusive
Instructions that explain how to achieve a result by performing a sequence of steps.
prepTimetoolstepyieldsupplyestimatedCosttotalTimeperformTime
Moviesibling via CreativeWorkschema.org/Movie ↗8 exclusive
A movie.
durationmusicByactortrailertitleEIDRsubtitleLanguageproductionCompanydirector
ExercisePlansibling via CreativeWorkschema.org/ExercisePlan ↗8 exclusive
Fitness-related activity designed for a specific health-related purpose, including defined exercise routines as well as activity prescribed by a clinician.
exerciseTypeworkloadintensityrepetitionsactivityFrequencyrestPeriodsadditionalVariableactivityDuration
HowToDirectionsibling via CreativeWorkschema.org/HowToDirection ↗8 exclusive
A direction indicating a single action to do in the instructions for how to achieve a result.
duringMediaprepTimetoolsupplybeforeMediatotalTimeperformTimeafterMedia
RealEstateListingchild / upgradeschema.org/RealEstateListing ↗+2 props
A [[RealEstateListing]] is a listing that describes one or more real-estate [[Offer]]s (whose [[businessFunction]] is typically to lease out, or to sell). The
datePostedleaseLength
MedicalWebPagechild / upgradeschema.org/MedicalWebPage ↗+1 props
A web page that provides medical information.
medicalAudience
QAPagechild / upgradeschema.org/QAPage ↗+0 props
A QAPage is a WebPage focussed on a specific Question and its Answer(s), e.g. in a question answering site or documenting Frequently Asked Questions (FAQs).
ContactPagechild / upgradeschema.org/ContactPage ↗+0 props
Web page type: Contact page.
AboutPagechild / upgradeschema.org/AboutPage ↗+0 props
Web page type: About page.
ProfilePagechild / upgradeschema.org/ProfilePage ↗+0 props
Web page type: Profile page.
CollectionPagechild / upgradeschema.org/CollectionPage ↗+0 props
Web page type: Collection page.
ItemPagechild / upgradeschema.org/ItemPage ↗+0 props
A page devoted to a single item, such as a particular product or hotel.
CheckoutPagechild / upgradeschema.org/CheckoutPage ↗+0 props
Web page type: Checkout page.
SearchResultsPagechild / upgradeschema.org/SearchResultsPage ↗+0 props
Web page type: Search results page.
FAQPagechild / upgradeschema.org/FAQPage ↗+0 props
A [[FAQPage]] is a [[WebPage]] presenting one or more "[Frequently asked questions](https://en.wikipedia.org/wiki/FAQ)" (see also [[QAPage]]).
◈ Structural Negative Type Space — Constitutional Law VI
◈ Action Branch

No structural connection to the Action branch. Graph position measurement. schema.org/Action ↗ · Law III — meaning is yours.

◈ BioChemEntity Branch

No structural connection to the BioChemEntity branch. Graph position measurement. schema.org/BioChemEntity ↗ · Law III — meaning is yours.

◈ Event Branch

No structural connection to the Event branch. Graph position measurement. schema.org/Event ↗ · Law III — meaning is yours.

◈ Intangible Branch

No structural connection to the Intangible branch. Graph position measurement. schema.org/Intangible ↗ · Law III — meaning is yours.

◈ MedicalEntity Branch

No structural connection to the MedicalEntity branch. Graph position measurement. schema.org/MedicalEntity ↗ · Law III — meaning is yours.

◈ Organization Branch

No structural connection to the Organization branch. Graph position measurement. schema.org/Organization ↗ · Law III — meaning is yours.

◈ Person Branch

No structural connection to the Person branch. Graph position measurement. schema.org/Person ↗ · Law III — meaning is yours.

◈ Place Branch

No structural connection to the Place branch. Graph position measurement. schema.org/Place ↗ · Law III — meaning is yours.

◈ Product Branch

No structural connection to the Product branch. Graph position measurement. schema.org/Product ↗ · Law III — meaning is yours.

◈ Taxon Branch

No structural connection to the Taxon branch. Graph position measurement. schema.org/Taxon ↗ · Law III — meaning is yours.

◈ Gap List (52 properties unmapped)
significantLinkmainContentOfPagereviewedByspeakablelastReviewedspecialtyrelatedLinkfundingprovidergenrewordCountaccessModeSufficientacquireLicensePagetemporalCoveragethumbnailcommentCountdisplayLocationarchivedAtdigitalSourceTypeassesseslicensekeywordshasPartfunderaccessModeaggregateRatingmaterialaccessibilityControlrecordedAtmaintainer
+22 more gaps not shown
◈ Source Schema.org — Raw Extraction (3 blocks)
Block 1 · @type: unknown
{
  "@context": "https://schema.org",
  "@graph": [
    {
      "@type": "WebPage",
      "@id": "https://startree.ai/",
      "url": "https://startree.ai/",
      "name": "StarTree: The Platform for Continuous Insights",
      "isPartOf": {
        "@id": "https://startree.ai/#website"
      },
      "about": {
        "@id": "https://startree.ai/#organization"
      },
      "datePublished": "2022-12-08T19:02:27+00:00",
      "dateModified": "2026-04-06T18:01:16+00:00",
      "description": "A managed real-time analytics platform designed for OLAP at massive speed and scale, built on top of Apache Pinot.",
      "breadcrumb": {
        "@id": "https://startree.ai/#breadcrumb"
      },
      "inLanguage": "en-US",
      "potentialAction": [
        {
          "@type": "ReadAction",
          "target": [
            "https://startree.ai/"
          ]
        }
      ]
    },
    {
      "@type": "BreadcrumbList",
      "@id": "https://startree.ai/#breadcrumb",
      "itemListElement": [
        {
          "@type": "ListItem",
          "position": 1,
          "name": "Home"
        }
      ]
    },
    {
      "@type": "WebSite",
      "@id": "https://startree.ai/#website",
      "url": "https://startree.ai/",
      "name": "StarTree",
      "description": "",
      "publisher": {
        "@id": "https://startree.ai/#organization"
      },
      "potentialAction": [
        {
          "@type": "SearchAction",
          "target": {
            "@type": "EntryPoint",
            "urlTemplate": "https://startree.ai/?s={search_term_string}"
          },
          "query-input": {
            "@type": "PropertyValueSpecification",
            "valueRequired": true,
            "valueName": "search_term_string"
          }
        }
      ],
      "inLanguage": "en-US"
    },
    {
      "@type": "Organization",
      "@id": "https://startree.ai/#organization",
      "name": "StarTree",
      "url": "https://startree.ai/",
      "logo": {
        "@type": "ImageObject",
        "inLanguage": "en-US",
        "@id": "https://startree.ai/#/schema/logo/image/",
        "url": "https://startree.ai/wp-content/uploads/2025/07/startree-logo-dark.png",
        "contentUrl": "https://startree.ai/wp-content/uploads/2025/07/startree-logo-dark.png",
        "width": 207,
        "height": 36,
        "caption": "StarTree"
      },
      "image": {
        "@id": "https://startree.ai/#/schema/logo/image/"
      },
      "sameAs": [
        "https://x.com/startreedata",
        "https://www.linkedin.com/company/startreedata/",
        "https://www.youtube.com/startreedata"
      ]
    }
  ]
}
◈ Source: https://www.startree.ai/ · Law I — Provenance
Block 2 · @type: WebSite
{
  "@context": "https://schema.org",
  "@type": "WebSite",
  "name": "StarTree Docs",
  "creator": {
    "@type": "Organization",
    "name": "Mintlify",
    "url": "https://mintlify.com"
  }
}
◈ Source: https://docs.startree.ai/reference/startree-release-notes/ · Fetched: 2026-05-15T20:56:33Z · Law I — Provenance
Block 3 · @type: unknown
{
  "@context": "https://schema.org",
  "@graph": [
    {
      "@type": "WebPage",
      "@id": "https://startree.ai/user-stories/crowdstrike-scaling-realtime-security-analytics-with-pinot/",
      "url": "https://startree.ai/user-stories/crowdstrike-scaling-realtime-security-analytics-with-pinot/",
      "name": "Taming the Kafka Firehose: How CrowdStrike is Scaling Real-Time Security Analytics with Pinot | StarTree",
      "isPartOf": {
        "@id": "https://startree.ai/#website"
      },
      "primaryImageOfPage": {
        "@id": "https://startree.ai/user-stories/crowdstrike-scaling-realtime-security-analytics-with-pinot/#primaryimage"
      },
      "image": {
        "@id": "https://startree.ai/user-stories/crowdstrike-scaling-realtime-security-analytics-with-pinot/#primaryimage"
      },
      "thumbnailUrl": "https://startree.ai/wp-content/uploads/2025/08/rt-crowdstrikel-talk.jpg",
      "datePublished": "2026-01-15T17:52:43+00:00",
      "dateModified": "2026-01-16T06:23:49+00:00",
      "breadcrumb": {
        "@id": "https://startree.ai/user-stories/crowdstrike-scaling-realtime-security-analytics-with-pinot/#breadcrumb"
      },
      "inLanguage": "en-US",
      "potentialAction": [
        {
          "@type": "ReadAction",
          "target": [
            "https://startree.ai/user-stories/crowdstrike-scaling-realtime-security-analytics-with-pinot/"
          ]
        }
      ]
    },
    {
      "@type": "ImageObject",
      "inLanguage": "en-US",
      "@id": "https://startree.ai/user-stories/crowdstrike-scaling-realtime-security-analytics-with-pinot/#primaryimage",
      "url": "https://startree.ai/wp-content/uploads/2025/08/rt-crowdstrikel-talk.jpg",
      "contentUrl": "https://startree.ai/wp-content/uploads/2025/08/rt-crowdstrikel-talk.jpg",
      "width": 1280,
      "height": 720
    },
    {
      "@type": "BreadcrumbList",
      "@id": "https://startree.ai/user-stories/crowdstrike-scaling-realtime-security-analytics-with-pinot/#breadcrumb",
      "itemListElement": [
        {
          "@type": "ListItem",
          "position": 1,
          "name": "Home",
          "item": "https://startree.ai/"
        },
        {
          "@type": "ListItem",
          "position": 2,
          "name": "Taming the Kafka Firehose: How CrowdStrike is Scaling Real-Time Security Analytics with Pinot"
        }
      ]
    },
    {
      "@type": "WebSite",
      "@id": "https://startree.ai/#website",
      "url": "https://startree.ai/",
      "name": "StarTree",
      "description": "",
      "publisher": {
        "@id": "https://startree.ai/#organization"
      },
      "potentialAction": [
        {
          "@type": "SearchAction",
          "target": {
            "@type": "EntryPoint",
            "urlTemplate": "https://startree.ai/?s={search_term_string}"
          },
          "query-input": {
            "@type": "PropertyValueSpecification",
            "valueRequired": true,
            "valueName": "search_term_string"
          }
        }
      ],
      "inLanguage": "en-US"
    },
    {
      "@type": "Organization",
      "@id": "https://startree.ai/#organization",
      "name": "StarTree",
      "url": "https://startree.ai/",
      "logo": {
        "@type": "ImageObject",
        "inLanguage": "en-US",
        "@id": "https://startree.ai/#/schema/logo/image/",
        "url": "https://startree.ai/wp-content/uploads/2025/07/startree-logo-dark.png",
        "contentUrl": "https://startree.ai/wp-content/uploads/2025/07/startree-logo-dark.png",
        "width": 207,
        "height": 36,
        "caption": "StarTree"
      },
      "image": {
        "@id": "https://startree.ai/#/schema/logo/image/"
      },
      "sameAs": [
        "https://x.com/startreedata",
        "https://www.linkedin.com/company/startreedata/",
        "https://www.youtube.com/startreedata"
      ]
    }
  ]
}
◈ Source: https://startree.ai/user-stories/crowdstrike-scaling-realtime-security-analytics-with-pinot/ · Fetched: 2026-05-15T20:56:33Z · Law I — Provenance
schema.org v2.0.0 · source: https://www.startree.ai/ schema.org/WebPage ↗
Semantic Words 40 words · frequency ranked · Law III
40 words · top 5: pinot · data · startree · real · analytics · click to expand
Top 40 words by frequency from https://www.startree.ai/ + 3 interior pages (1,771 words total). Stop-words stripped. Ranked by repetition.
#1pinot66x · 3.29%
#2data52x · 2.59%
#3startree50x · 2.49%
#4real35x · 1.74%
#5analytics31x · 1.55%
#6apache26x · 1.3%
#7recipes23x · 1.15%
#8query21x · 1.05%
#9cloud19x · 0.95%
#10crowdstrike18x · 0.9%
#11second17x · 0.85%
#12observability16x · 0.8%
#13using15x · 0.75%
#14security15x · 0.75%
#15queries15x · 0.75%
#16performance13x · 0.65%
#17kafka13x · 0.65%
#18events13x · 0.65%
#19github13x · 0.65%
#20docs11x · 0.55%
#21clone11x · 0.55%
#22customer10x · 0.5%
#23talk10x · 0.5%
#24built10x · 0.5%
#25per10x · 0.5%
#26token9x · 0.45%
#27llms9x · 0.45%
#28platform9x · 0.45%
#29facing9x · 0.45%
#30documentation9x · 0.45%
#31repository9x · 0.45%
#32insights8x · 0.4%
#33infrastructure8x · 0.4%
#34storage8x · 0.4%
#35webex7x · 0.35%
#36demand7x · 0.35%
#37operations7x · 0.35%
#38across7x · 0.35%
#39latency7x · 0.35%
#40ingestion7x · 0.35%
Law III — frequency measured, meaning is the reader's · source: https://www.startree.ai/
Text Topology Fingerprint v1.0.0 · long · 20,921 chars · Law III
Six-layer pre-linguistic shape measurement. Deterministic. Same input, same output, always. Hash: 67642aff74ea33e77fa7d40c7af81846...
◈ Signal Matrix
0.420
TTR
0.254
HAPAX
0.746
REP
0.382
BIGRAM
0.603
H2T
0.298
CPRT
3.373
SKEW
16.299
KURT
0.655
C/P
1.650
PENT
0.571
S1P
0.003
NASC
TTR=type-token ratio · HAPAX=hapax ratio · REP=repetition score · BIGRAM=bigram repetition · H2T=hapax-to-type · CPRT=capital token ratio · SKEW=sentence skewness · KURT=sentence kurtosis · C/P=comma-period ratio · PENT=punct entropy · S1P=single-sent para ratio · NASC=non-ASCII ratio
◈ Topology Position
Latin dominant · moderate lexical diversity · mixed register · moderate clause complexity · moderate topic focus · strong uncommon edge signal
◈ Six Measurement Layers
Layer 1 — Character
0.0034
Non-ASCII Ratio
0.0 = Latin-dominant · 1.0 = fully non-Latin script
Layer 1 — Character
3.3028
Character Entropy
Shannon entropy of character distribution.
Layer 1 — Character
'e' (2069x)
Most Frequent
Highest-frequency character. Law V — common edge.
Layer 2 — Token
0.4203
Type-Token Ratio
Unique tokens / total tokens. Lexical diversity signal.
Layer 2 — Token
0.2536
Hapax Ratio
Tokens appearing exactly once. Law VI — uncommon edge.
Layer 6 — Document
0.6034
Hapax to Type
Hapax count / unique token count.
Layer 3 — Punctuation
0.6552
Comma/Period Ratio
Clause complexity per sentence.
Layer 3 — Punctuation
1.6497
Punct Entropy
Shannon entropy across punctuation types.
Layer 4 — Sentence
119
Sentence Count
Total detected sentences across all crawled pages.
Layer 4 — Sentence
3.3733
Skewness
Positive = long-tail. Negative = conversational.
Layer 5 — Paragraph
0.5714
Single Sent Ratio
High = web copy. Low = academic prose.
Layer 6 — Document
0.7464
Repetition Score
Tokens appearing more than once / total.
◈ Token Length Distribution
1-3
26%
4-6
36%
7-10
30%
11-15
7%
16-20
0%
21+
0%
◈ Density Gradient — TTR per Document Tenth
Front-loaded = abstract/preamble · Flat = consistent prose · Back-loaded = building complexity
◈ Lexical Richness Curve — Rolling Window TTR
0.51.0
Window=50 tokens · Step=25 · 120 data points
topology_fingerprint.py v1.0.0 · sha256: 67642aff74ea33e7... · Law III + Law VI
Ratio Signals 8 deterministic measurements · the gap is the signal
Eight deterministic measurements. Law I: every value traces to its source stage.
schema density
0.7250
Schema props extracted / top semantic words.
nav ratio
0.4922
Nav URLs / total internal URLs.
content to structure ratio
0.0157
Total words / raw HTML bytes. Content density.
external tld diversity
3
Unique TLD count in outbound links.
self declaration coherence
0.4867
Fuzzy overlap across title / H1 / meta / schema name.
schema to nav alignment
0.0000
Schema type tokens vs nav link text overlap.
javascript surface ratio
0.0000
Fraction of interior pages JS-gated.
URL Depth Distribution
depth_0: 6 · depth_1: 23 · depth_2: 55 · depth_3plus: 44
Internal URLs by path depth. Depth 0 = root.
Tech Stack · Security · Freshness SecurityLabel.MINIMAL · FreshnessLabel.RECENT
Sitemap: ✗Robots.txt: ✗Schema.org: ✓Open Graph: ✓Canonical: ✓HTTPS: ✓HSTS: ✗CSP: ✗
Security
SecurityLabel.MINIMAL
Freshness
FreshnessLabel.RECENT
Server
cloudflare
cmsWordPress
cdnCloudflare
web_servercloudflare
analytics['Google Analytics', 'Google Tag Manager']
Ledger Appends 11 ledgers · graph edge traversal · Law V+VII
Build: national-transit-v1.0.0 Spec: Root-LD v1.0 Status: LIVE Minted: 2026-05-15
startree.ai · gdr-12324805
startree.ai is recorded in the Global Data Registry — open provenance infrastructure for the machine-readable web.
View the Registry →