Downloads & Free Reading Options - Results

Dtic Ada366202%3a Density Biased Sampling%3a An Improved Method For Data Mining And Clustering by Defense Technical Information Center

Read "Dtic Ada366202%3a Density Biased Sampling%3a An Improved Method For Data Mining And Clustering" by Defense Technical Information Center through these free online access and download options.

Search for Downloads

Search by Title or Author

Books Results

Source: The Internet Archive

The internet Archive Search Results

Available books for downloads and borrow from The internet Archive

1DTIC ADA366202: Density Biased Sampling: An Improved Method For Data Mining And Clustering

By

Data mining in large data sets often requires a sampling or summarization step to form an in-core representation of the data that can be processed more efficiently. Uniform random sampling is frequently used in practice and also frequently criticized because it will miss small clusters. Many natural phenomena are known to follow Zipf's distribution and the inability of uniform sampling to find small clusters is of practical concern. Density Biased Sampling is proposed to probabilistically under-sample dense regions and over-sample light regions. A weighted sample is used to preserve the densities of the original data. Density biased sampling naturally includes uniform sampling as a special case. A memory efficient algorithm is proposed that approximates density biased sampling using only a single scan of the data. We empirically evaluate density biased sampling using synthetic data sets that exhibit varying cluster size distributions. Our proposed method scales linearly and out performs uniform samples when clustering realistic data sets.

“DTIC ADA366202: Density Biased Sampling: An Improved Method For Data Mining And Clustering” Metadata:

  • Title: ➤  DTIC ADA366202: Density Biased Sampling: An Improved Method For Data Mining And Clustering
  • Author: ➤  
  • Language: English

“DTIC ADA366202: Density Biased Sampling: An Improved Method For Data Mining And Clustering” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 28.92 Mbs, the file-s for this book were downloaded 80 times, the file-s went public at Tue Apr 24 2018.

Available formats:
Abbyy GZ - Additional Text PDF - Archive BitTorrent - DjVuTXT - Djvu XML - Image Container PDF - JPEG Thumb - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Scandata - Single Page Processed JP2 ZIP - chOCR - hOCR -

Related Links:

Online Marketplaces

Find DTIC ADA366202: Density Biased Sampling: An Improved Method For Data Mining And Clustering at online marketplaces:


Buy “Dtic Ada366202%3a Density Biased Sampling%3a An Improved Method For Data Mining And Clustering” online:

Shop for “Dtic Ada366202%3a Density Biased Sampling%3a An Improved Method For Data Mining And Clustering” on popular online marketplaces.