Llovizna: A Dynamic Data Warehousing Platform for Creating and Accessing Biomedical Data Lakes.

Saturday, September 10, 2016

A Dynamic Data Warehousing Platform for Creating and Accessing Biomedical Data Lakes.

This week we had our paper titled "A Dynamic Data Warehousing Platform for Creating and Accessing Biomedical Data Lakes." presented in Second International Workshop on Data Management and Analytics for Medicine and Healthcare (DMAH'16), co-located with 42 nd International Conference on Very Large Data Bases (VLDB 2016). Sep. 2016.

Abstract: Medical research use cases are population centric, unlike the clinical use cases which are patient or individual centric. Hence the research use cases require accessing medical archives and data source repositories of heterogeneous nature. Traditionally, in order to query data from these data sources, users manually access and download parts or whole of the data sources. The existing solutions tend to focus on a specific data format or storage, which prevents using them for a more generic research scenario with heterogeneous data sources where the user may not have the knowledge of the schema of the data a priori.

In this paper, we propose and discuss the design, implementation, and evaluation of Data Café, a scalable distributed architecture that aims to address the shortcomings in the existing approaches. Data Café lets the resource providers create biomedical data lakes from various data sources, and lets the research data users consume the data lakes efficiently and quickly without having a priori knowledge of the data schema.

No comments:

Post a Comment

You are welcome to provide your opinions in the comments. Spam comments and comments with random links will be deleted.

Subscribe to: Post Comments (Atom)

Frequent Labels

AbiCollab (1)

AbiWord (47)

Academia (1)

Accessibility (1)

ACRO (3)

ad blockers (1)

administration (1)

adsense (2)

ADSL (1)

aiCache (1)

Airavata (1)

Alaska (11)

Amazon (9)

Anchorage (2)

Andorra (1)

Anjuta (1)

Annual Post (14)

ANT (2)

Antwerp (1)

AOT (1)

Apache Bench (1)

Apache2 (3)

API Umbrella (1)

APIC (1)

Arabic (1)

Architecture (1)

Arctic (4)

Argumentum (1)

arts (1)

ASF (4)

Association Rule (2)

ASUS (1)

Atlanta (70)

Austria (3)

autoscaling (2)

Avidemux (1)

AWS (8)

Axis (1)

Axis2 (1)

Azure (1)

Back Orifice (1)

Bahamas (2)

Bananascript (1)

Barcelona (2)

Belgium (21)

bibtex (1)

BiDi (1)

Big Services (2)

BigData (4)

Bindaas (4)

Birkman (1)

bit.ly (1)

Bitmap Index (1)

Blogs (17)

BMI-CCI (1)

books (2)

boot (3)

Boston (1)

BPEL (1)

BPEL Editor (1)

Browsers (1)

Bugs (5)

Business Processes as a Service (1)

C++ (1)

Cache (1)

caMicroscope (1)

CAN (1)

Cantopop (1)

Cassandra (2)

Cassowary (2)

CAT (1)

Catroid (1)

Cbench (1)

CEAP (2)

CentOS (1)

ChanServ (1)

charity (1)

Charlotte (2)

ChatGPT (2)

chatroulette (1)

Cheese (1)

China (8)

Chinese (6)

Cloud Computing (13)

Cloud2Sim (6)

CloudSim (4)

ClustrMaps (2)

CMP (2)

Colombo (1)

Colorado (2)

commits (3)

communication (6)

Community Clouds (1)

Community Networks (1)

Computer Security (4)

Conferences (5)

Cooking (3)

CoopIS (5)

Copyrights (1)

couchsurfing (1)

COVID-19 (55)

Crawler4j (1)

Croatia (9)

Croatian (1)

Crossbuilding (6)

CSE (12)

CWL (1)

CXF (5)

Czechia (1)

DaCapo (1)

Data as a Service (1)

Data Cafe (1)

Data Cleaning (3)

Data Mining (5)

Data Quality (9)

DataStax (2)

DCM4CHE (2)

DCM4CHEE (1)

Debit Card (1)

Decade Post (1)

Deck (1)

delicious (1)

Denmark (1)

Dia (1)

Diaspora (1)

Dicom (8)

Digital Marketing Strategy (7)

DisCoTec (1)

dissertations (1)

DMAH (2)

DMKM (2)

Docker (3)

docstoc (1)

Documentation (6)

Drill (6)

Dropbox (5)

Dubai (1)

duckduckgo (1)

Duolingo (3)

DZone (2)

EC2 (3)

Eclipse (1)

Education (1)

Elance (1)

ELB (4)

EMA (4)

email (3)

EMDC (115)

Emirates (1)

EMJD-DC (135)

Emory BMI (16)

EMS (1)

Emulations (1)

EncFS (1)

Encryption (1)

Erasmus Mundus (38)

eScience (2)

eScience Workflows (1)

Estonia (1)

Ethereal (1)

EuroPar (1)

Event Notifier (2)

Evora (2)

Exchange Students (2)

Facebook (28)

Fairbanks (2)

Faro (1)

feed (2)

Felix (1)

ffmpeg (1)

FIMI (1)

Finance (2)

Finland (3)

firefox (1)

flights (2)

Floodlight (1)

FOSS (23)

France (3)

freelancer.com (3)

Freenet (1)

Freenode (1)

FSF (1)

Future Posts (42)

Garbage Collection (2)

GCI (6)

GCspy (1)

gdb (1)

gedit (1)

Germany (5)

Gerrit (2)

GHOP (1)

git (4)

GitHub (2)

Globus (1)

gmail (1)

gnome (2)

gnuplot (1)

Google (6)

Google Docs (1)

Google Maps (1)

Google Translate (1)

Google+ (4)

Governance as a Service (1)

Graffiti (1)

Gravitee (1)

Greece (1)

grep (1)

Grid Computing (1)

grive (1)

GSD (2)

GSoC (54)

GSoC Proposals (4)

gsoc2009unicode (18)

gsoc2010OgsaDaiPres (18)

gsoc2011 (4)

GSoC2012 (6)

GSoC2014 (11)

GSoC2015 (5)

GSoC2016 (2)

GSoC2019 (5)

GSoC2020 (3)

GT (1)

Gtk (1)

Hadoop (4)

Haiku (5)

Halloween (2)

Hazelcast (3)

Hinduism (1)

Hive (5)

Hoax (3)

Houston (1)

HR (2)

HTML (1)

HTTPD (2)

https (1)

Hungary (1)

IaaS (1)

IC2E (4)

ICWS (2)

IDE (1)

IFIP (2)

IJCISIM (1)

Image Processing (1)

In-Memory Data Grids (1)

India (5)

Indic Text (4)

Indonesia (2)

INESC-ID (6)

Infinispan (6)

Information Retrieval (1)

Instagram (1)

Integration (1)

Integrity Aristotle (1)

IntelliJ IDEA (3)

Internet (16)

Internet Security (6)

Internship (4)

IoT (1)

IRC (6)

IST (22)

Istanbul (1)

IT (2)

Italy (1)

Jaunty Jackalope (2)

Java PaaS (4)

javascript (2)

JAX-RS (2)

JAXB (1)

Jburg (1)

jersey (1)

Jikes RVM (9)

JMeter (2)

Karlstad (1)

Karmic Koala (3)

KAUST (3)

KDD (1)

Keycloak (2)

Keyman (1)

Kheops (5)

Kong (4)

KTH (1)

Kubernetes (1)

Language (1)

Latex (7)

Latvia (1)

Law (1)

LDAP (1)

LEAD (3)

libcanberra (1)

libccss (1)

libtool (2)

libxml2 (1)

Liechtenstein (1)

LinkedIn (12)

Linux (4)

Lisboa_memories (5)

Lisbon (109)

Lithuania (2)

Llovizna (80)

LMS (1)

Load Balancing (3)

Localization (8)

log4j (1)

London (3)

Lucid Lynx (1)

Luxembourg (1)

Lynx (1)

Mac (1)

Mac4Lin (2)

machine learning (1)

Malayalam (1)

Malaysia (1)

Maldives (1)

MapReduce (1)

markdown (1)

marketing (4)

MASCOTS (1)

Mashup (1)

Maven (1)

MEDIator (1)

MediCurator (1)

Mentor Summit (4)

Mentoring (1)

Mercurial (1)

Messaging4Transport (1)

Metacity (1)

Meteorology (1)

Mexico (2)

micronations (1)

Microsoft (1)

Middleware (1)

Mininet (7)

MMTk (5)

Mobile Networks (2)

mod proxy (2)

Modem (1)

Monaco (1)

MoNeTec (1)

MongoDB (2)

Mooshabaya (28)

Movies (1)

MSVC (1)

Music (5)

mysql (1)

NeoOffice (1)

Netherlands (3)

netiquette (3)

NetUber (5)

Network Softwarization (1)

Networks (2)

New Delhi (3)

New Orleans (1)

Nexus7 (1)

NFV (4)

Niffler (3)

nohup (1)

Norway (1)

NSC (1)

NV (1)

Oatmeal (1)

Obidos (3)

OGCE (1)

OGSA-DAI (18)

OHIF (2)

Omegle (1)

OMII-UK (5)

OMPP (1)

Oneiric Ocelot (1)

Open Office (1)

OpenDaylight (22)

OpenFlow (1)

Operating Systems (5)

Optimizing Compiler (2)

Orthanc (3)

OSGi (1)

PaaS (7)

Panama (2)

Paris (1)

Patches (3)

Paypal (2)

PCI (1)

PdfLatex (1)

Philosophy (4)

Philosophy of Science (1)

PhotoRec (1)

php (1)

pidgin (2)

PixelMed (1)

Portimão (1)

Porto (9)

Portugal (60)

Portuguese (3)

PostgreSQL (3)

PowerPoint (1)

Precise Pangolin (1)

pretty print (1)

Privacy (1)

Psychology (2)

Public Transportation (2)

PubSubHubbub (1)

Python (2)

Qatar (2)

QoS (1)

Rackspace (1)

Radiology (1)

RAPID (1)

ReadTheDocs (1)

RecordMyDesktop (3)

Red Hat (1)

Relational Databases (1)

REST (1)

Reviews (14)

Rhino (1)

Romania (4)

Roomster (1)

Router (1)

RTFW (2)

S3 (1)

SaaS (2)

SAGA (1)

Saudi Arabia (4)

Scam (40)

Scientific Linux (1)

scp (3)

ScreenCasts (6)

screenr (1)

SDCPS (1)

SDDS (1)

SDN (36)

SDNSim (2)

SDS (11)

Seattle (2)

Security (1)

Self help (2)

SELinux (2)

SEO (1)

Serbia (1)

SFC (1)

ShareLatex (1)

SIIM (2)

Simulations (1)

Singapore (2)

Sinhala (1)

Sintra (1)

Skype (4)

Slovakia (2)

SLT (1)

SMART (1)

SOA (4)

social media (21)

SoCPaR2010 (7)

Software Engineering (2)

South Korea (3)

space (1)

Spain (4)

Spam (25)

spamhaus (1)

Spanish (2)

Spark (1)

Sqoop (2)

Sri Lanka (37)

ssh (3)

SSI (1)

stickiness (1)

Stockholm (10)

Stratos (14)

SVN (4)

Sweden (8)

Switzerland (4)

synapse (3)

Systems (4)

Taguspark (1)

Tamil (2)

Taverna (1)

TAVM (3)

TCIA (1)

tcp (1)

telugu (1)

TestDisk (1)

Testing (3)

Themes (2)

thermald (1)

Tickets (1)

Time (10)

Toil (1)

Tomcat (1)

Trac (1)

TransferWise (1)

Translation (1)

Transliteration (1)

Travel2Be (1)

TravelGenio (1)

Travels (176)

TripAdvisor (2)

trojan (1)

TTS (1)

Turkey (2)

twitter (26)

Tyk (5)

Uber (1)

UbuDSL (1)

Ubuntu (12)

UCC (1)

UCC 2014 (1)

uCertify (1)

UI (1)

Unicode (6)

Unicows (1)

UniPlaces (2)

Unity (1)

UoM (8)

UoM_memories (3)

Update Manager (1)

URL Shortener (1)

USA (10)

User-Friendliness (1)

Utqiagvik (4)

Videos (2)

VirtualBox (6)

Virus (2)

Visa (1)

VLDB (1)

VMWare (1)

VPN (1)

WADL (1)

weather (2)

web (4)

Web Services (1)

web-2.0 (5)

WebCam (1)

WebEx (1)

Webinars (4)

wiki (1)

Wikipedia (5)

Windows (5)

Windows API (2)

Windows XP (2)

Wine (1)

Wireshark (1)

WOA (1)

Word Processors (1)

Workflows (1)

World (1)

WS-Messenger (3)

WSDL (1)

WSF/PHP (1)

WSO2 (34)

WSRF (1)

WWW2011 (2)

XaaS (1)

xbaya (7)

xchat-gnome (1)

Xenial Xerus (2)

xOA (1)

xSDN (3)

XVidCap (1)

YaCy (1)

youtube (5)

Zekr (1)

Zoom (2)

zotero (1)