Collecting and archiving tweets: a DataPool case study: Version 1
Collecting and archiving tweets: a DataPool case study: Version 1
  Information presented to a user via Twitter is variously called a ‘stream’, that is, a constant flow of data passing the viewer or reader. Where the totality of information passing through Twitter at any moment is considered, the flow is often referred to as a ‘firehose’, in other words, a gushing torrent of information. Blink and you’ve missed it. But does this information have only momentary value or relevance? Is there additional value in collecting, storing and preserving these data? This short report describes a small case study in archiving collected tweets by, and about, a research data project, DataPool at the University of Southampton. It explains the constraints imposed by Twitter on the use of such collections, describes how a service for collections evolved within these constraints, and illustrates the practical issues and choices that resulted in an archived collection.
  
    University of Southampton
   
  
    
      Hitchcock, Steve
      
        c0b120a1-439e-43c9-9ba6-647e77f40f3c
      
     
  
  
   
  
  
    
      27 March 2013
    
    
  
  
    
      Hitchcock, Steve
      
        c0b120a1-439e-43c9-9ba6-647e77f40f3c
      
     
  
       
    
 
  
    
      
  
  
  
  
  
  
    Hitchcock, Steve
  
  
  
  
   
    (2013)
  
  
    
    Collecting and archiving tweets: a DataPool case study: Version 1
  
  
  
  
  
  
  
  University of Southampton 
  10pp.
  
  
  
  
  
   
  
    
      Record type:
      Monograph
      
      (Project Report)
      
    
   
    
    
      
        
          Abstract
          Information presented to a user via Twitter is variously called a ‘stream’, that is, a constant flow of data passing the viewer or reader. Where the totality of information passing through Twitter at any moment is considered, the flow is often referred to as a ‘firehose’, in other words, a gushing torrent of information. Blink and you’ve missed it. But does this information have only momentary value or relevance? Is there additional value in collecting, storing and preserving these data? This short report describes a small case study in archiving collected tweets by, and about, a research data project, DataPool at the University of Southampton. It explains the constraints imposed by Twitter on the use of such collections, describes how a service for collections evolved within these constraints, and illustrates the practical issues and choices that resulted in an archived collection.
         
      
      
        
          
            
  
    Text
 datapoolcase-tweeps-final.docx
     - Other
   
  
  
 
          
            
          
            
           
            
           
        
          
            
  
    Text
 datapoolcase-tweeps-final.pdf
     - Other
   
  
  
 
          
            
          
            
           
            
           
        
        
       
    
   
  
  
  More information
  
    
      Published date: 27 March 2013
 
    
  
  
    
  
    
  
    
  
    
  
    
     
    
  
    
  
    
  
    
  
  
  
    
  
  
        Identifiers
        Local EPrints ID: 350646
        URI: http://eprints.soton.ac.uk/id/eprint/350646
        
        
        
        
          PURE UUID: 889db4f5-3dc9-4ab7-8c11-d368ca728f21
        
  
    
        
          
        
    
  
  Catalogue record
  Date deposited: 27 Mar 2013 17:09
  Last modified: 14 Mar 2024 13:29
  Export record
  
  
 
 
  
    
    
      Contributors
      
          
          Author:
          
            
            
              Steve Hitchcock
            
          
        
      
      
      
    
  
   
  
    Download statistics
    
      Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
      
      View more statistics