<div dir="ltr">Hi David,<div><br></div><div>Thank you for this. I&#39;ve managed to pin it down to the Symplectic Elements &quot;get_records&quot; script. The userid is NULL, but the actor is </div><div><br></div><div>&quot;/opt/eprints3/bin/get_records&quot;</div><div><br></div><div>Every record with this actor appears to have two revision = 1 entries and is created and immediately destroyed. I&#39;m yet to find an EPrint ID amongst these that links to a real record. The &quot;/usr/sbin/apache2&quot; actor records are behaving as I&#39;d expect.</div><div><br></div><div>I wonder why it&#39;s behaving in this way. Not to worry, we hopefully won&#39;t run out of numbers any time soon! It just means the history table is gigantic and it&#39;s already pretty weighty as it is. I&#39;ll look into it, but we&#39;re hopefully getting RT2 in this lifetime so maybe it won&#39;t be a problem then.</div><div><br></div><div>Thanks,</div><div>James</div><div><br></div><div><br></div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Mon, Aug 24, 2020 at 4:10 PM David R Newman &lt;<a href="mailto:drn@ecs.soton.ac.uk">drn@ecs.soton.ac.uk</a>&gt; wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">

  
  <div>
    <p>Hi James,</p>
    <p>If you are interacting with a third party application then this
      may explain the large number of &quot;empty&quot; eprint IDs.  With just
      human user it is possible you will still get a few of these where
      a user goes to create a record and then never gets round to
      entering any metadata.  They may then at some point go round and
      delete all their &quot;empty&quot; eprint records to tidy things up. 
      However, the automated creation by a third party app seems more
      likely.  You should get the userid and actor for these history
      records to see if you can see a pattern.</p>
    <p>Regards</p>
    <p>David Newman<br>
    </p>
    <div>On 24/08/2020 16:02, James Kerwin via
      Eprints-tech wrote:<br>
    </div>
    <blockquote type="cite">
      
      <div dir="ltr">Should I be concerned about the number of &quot;empty&quot;
        eprint IDs I see in the history table?
        <div><br>
        </div>
        <div>It appears there are a lot that have two instances of
          &quot;revision = 1&quot; where the record appears to be briefly created
          and then immediately destroyed and the relevant ID is skipped
          over and never used. I am making sure that I only look for
          &quot;datasetid = eprint&quot;.</div>
        <div><br>
        </div>
        <div>I don&#39;t want to get too bogged down with this because it&#39;s
          not the end of the world, but I am tempted to pull on the
          thread and see what&#39;s going on.</div>
        <div><br>
        </div>
        <div>Thanks,</div>
        <div>James<br>
          <div><br>
          </div>
        </div>
      </div>
      <br>
      <div class="gmail_quote">
        <div dir="ltr" class="gmail_attr">On Mon, Aug 24, 2020 at 2:41
          PM James Kerwin via Eprints-tech &lt;<a href="mailto:eprints-tech@ecs.soton.ac.uk" target="_blank">eprints-tech@ecs.soton.ac.uk</a>&gt;
          wrote:<br>
        </div>
        <blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
          <div dir="ltr">Ahhhh that&#39;s it! Thank you!<br>
            <div><br>
            </div>
            <div>I&#39;m now slightly embarrassed to say how long I spent
              searching through the various EPrints tables looking for
              this.</div>
            <div><br>
            </div>
            <div>The initial plan is to have a script that looks at how
              many items are put in the repository and breaks them down
              by some time period (e.g. month). If it&#39;s still wanted
              I&#39;ll make it into a button on the admin side that provides
              either a spreadsheet/google docs link or maybe even use it
              as an opportunity to play with graph modules etc.</div>
            <div><br>
            </div>
            <div>We did look into something similar last year where we
              get the upload date and proper deposit date (as defined by
              EPrints) to assess how long items spent in review. This
              felt a little bit too much like surveilling staff which
              isn&#39;t something I&#39;m on board with so it was quickly
              dropped. &quot;Do no evil...&quot; and so on.</div>
            <div><br>
            </div>
            <div>Thanks,</div>
            <div>James</div>
          </div>
          <br>
          <div class="gmail_quote">
            <div dir="ltr" class="gmail_attr">On Mon, Aug 24, 2020 at
              2:08 PM John Salter &lt;<a href="mailto:J.Salter@leeds.ac.uk" target="_blank">J.Salter@leeds.ac.uk</a>&gt;
              wrote:<br>
            </div>
            <blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
              <div lang="EN-GB">
                <div>
                  <p class="MsoNormal"><span style="font-size:11pt;font-family:Calibri,sans-serif;color:rgb(31,73,125)">Hi
                      James,<br>
                      The &#39;history&#39; dataset is your friend here!<br>
                      <br>
                    </span></p>
                  <p class="MsoNormal"><span style="font-size:11pt;font-family:Calibri,sans-serif;color:rgb(31,73,125)">Are
                      you wanting to do this for a handful of records,
                      or script something?</span></p>
                  <p class="MsoNormal"><span style="font-size:11pt;font-family:Calibri,sans-serif;color:rgb(31,73,125)"> </span></p>
                  <p class="MsoNormal"><span style="font-size:11pt;font-family:Calibri,sans-serif;color:rgb(31,73,125)">A
                      dataset search along these lines should work:<br>
                      dataset: history</span></p>
                  <p class="MsoNormal"><span style="font-size:11pt;font-family:Calibri,sans-serif;color:rgb(31,73,125)">datasetid:
                      eprint</span></p>
                  <p class="MsoNormal"><span style="font-size:11pt;font-family:Calibri,sans-serif;color:rgb(31,73,125)">dataobjid:
                      the eprint id you are interested in</span></p>
                  <p class="MsoNormal"><span style="font-size:11pt;font-family:Calibri,sans-serif;color:rgb(31,73,125)">action:
                      &#39;create&#39;</span></p>
                  <p class="MsoNormal"><span style="font-size:11pt;font-family:Calibri,sans-serif;color:rgb(31,73,125)"> </span></p>
                  <p class="MsoNormal"><span style="font-size:11pt;font-family:Calibri,sans-serif;color:rgb(31,73,125)">The
                      history dataset is searchable via the web
                      interface, but for some older versions or EPrints
                      you might want to add the &#39;datasetid&#39; to the
                      search form.</span></p>
                  <p class="MsoNormal"><span style="font-size:11pt;font-family:Calibri,sans-serif;color:rgb(31,73,125)"> </span></p>
                  <p class="MsoNormal"><span style="font-size:11pt;font-family:Calibri,sans-serif;color:rgb(31,73,125)">Let
                      me know if you need more info!</span></p>
                  <p class="MsoNormal"><span style="font-size:11pt;font-family:Calibri,sans-serif;color:rgb(31,73,125)">Cheers,</span></p>
                  <p class="MsoNormal"><span style="font-size:11pt;font-family:Calibri,sans-serif;color:rgb(31,73,125)">John</span></p>
                  <p class="MsoNormal"><span style="font-size:11pt;font-family:Calibri,sans-serif;color:rgb(31,73,125)"> </span></p>
                  <p class="MsoNormal"><b><span style="font-size:11pt;font-family:Calibri,sans-serif" lang="EN-US">From:</span></b><span style="font-size:11pt;font-family:Calibri,sans-serif" lang="EN-US"> <a href="mailto:eprints-tech-bounces@ecs.soton.ac.uk" target="_blank">eprints-tech-bounces@ecs.soton.ac.uk</a>
                      [mailto:<a href="mailto:eprints-tech-bounces@ecs.soton.ac.uk" target="_blank">eprints-tech-bounces@ecs.soton.ac.uk</a>]
                      <b>On Behalf Of </b>James Kerwin via Eprints-tech<br>
                      <b>Sent:</b> 24 August 2020 13:43<br>
                      <b>To:</b> <a href="mailto:eprints-tech@ecs.soton.ac.uk" target="_blank">eprints-tech@ecs.soton.ac.uk</a><br>
                      <b>Subject:</b> [EP-tech] Date Record Created</span></p>
                  <p class="MsoNormal"> </p>
                  <div>
                    <p class="MsoNormal">Afternoon All,</p>
                    <div>
                      <p class="MsoNormal"> </p>
                    </div>
                    <div>
                      <p class="MsoNormal">In the EPrints database is
                        there data on when a record was created? We have
                        the date an item is deposited which indicates
                        when an item was made live in the repository.
                        The record is created prior to this when a user
                        uploads a file or OA Link through Elements. The
                        record is created in the review buffer.</p>
                    </div>
                    <div>
                      <p class="MsoNormal"> </p>
                    </div>
                    <div>
                      <p class="MsoNormal">When a record is modified
                        there is a &quot;last mod&quot; date and when it goes into
                        the live archive this is treated as the deposit
                        date.</p>
                    </div>
                    <div>
                      <p class="MsoNormal"> </p>
                    </div>
                    <div>
                      <p class="MsoNormal">If not I can find a way to
                        make it happen in future. It would be incredibly
                        helpful if I didn&#39;t need to do this.</p>
                    </div>
                    <div>
                      <p class="MsoNormal"> </p>
                    </div>
                    <div>
                      <p class="MsoNormal">Thanks,</p>
                    </div>
                    <div>
                      <p class="MsoNormal">James</p>
                    </div>
                  </div>
                </div>
              </div>
            </blockquote>
          </div>
          *** Options: <a href="http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech" rel="noreferrer" target="_blank">http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech</a><br>
          *** Archive: <a href="https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.eprints.org%2Ftech.php%2F&amp;data=01%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7C4c176746a6ac4b90867808d848416199%7C4a5378f929f44d3ebe89669d03ada9d8%7C0&amp;sdata=i8wgNX4jkCEu70%2Frz%2FN2sxZ6AOs0WmN6HCMgis315jM%3D&amp;reserved=0" originalSrc="http://www.eprints.org/tech.php/" shash="HRURG/ChQ3ov4T+WtydjZ8kiLLhgjmLSVWDW4r0LGJyeaYwlw8J/JK+EM6lsA8dT4x8YYlEojY9fgt2XSZELHZVHtvjRdYy0iPuTdgnn5qPxO7O/NV377CUkn6/5HYAiaJ5N24Q84t49m5FoMqVwGGR5ZBCffJEO2Mrc9bNpkmo=" rel="noreferrer" target="_blank">http://www.eprints.org/tech.php/</a><br>
          *** EPrints community wiki: <a href="https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwiki.eprints.org%2F&amp;data=01%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7C4c176746a6ac4b90867808d848416199%7C4a5378f929f44d3ebe89669d03ada9d8%7C0&amp;sdata=K5HzCa5iLO9h8Jotwu8kO935UvUpmNRtlWKcQfpObWI%3D&amp;reserved=0" originalSrc="http://wiki.eprints.org/" shash="IaItJbolh36EbK7H21vVmItQcGzLy1ak02/AAvCduQKOWO6zUheRHnrk8a4dtXgwXVQ4bkde7JIdtEkT8bC6o+RnSsvtv2ZO0k7s4eZXst2uGcx8e2ZCgwhWeey5Y20Wop3EITAap1/GaZueRS82d2qGnpmsyxTzxh/Up1WC8m0=" rel="noreferrer" target="_blank">http://wiki.eprints.org/</a></blockquote>
      </div>
      <br>
      <fieldset></fieldset>
      <pre>*** Options: <a href="http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech" target="_blank">http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech</a>
*** Archive: <a href="https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.eprints.org%2Ftech.php%2F&amp;data=01%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7C4c176746a6ac4b90867808d848416199%7C4a5378f929f44d3ebe89669d03ada9d8%7C0&amp;sdata=i8wgNX4jkCEu70%2Frz%2FN2sxZ6AOs0WmN6HCMgis315jM%3D&amp;reserved=0" originalSrc="http://www.eprints.org/tech.php/" shash="HRURG/ChQ3ov4T+WtydjZ8kiLLhgjmLSVWDW4r0LGJyeaYwlw8J/JK+EM6lsA8dT4x8YYlEojY9fgt2XSZELHZVHtvjRdYy0iPuTdgnn5qPxO7O/NV377CUkn6/5HYAiaJ5N24Q84t49m5FoMqVwGGR5ZBCffJEO2Mrc9bNpkmo=" target="_blank">http://www.eprints.org/tech.php/</a>
*** EPrints community wiki: <a href="https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwiki.eprints.org%2F&amp;data=01%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7C4c176746a6ac4b90867808d848416199%7C4a5378f929f44d3ebe89669d03ada9d8%7C0&amp;sdata=K5HzCa5iLO9h8Jotwu8kO935UvUpmNRtlWKcQfpObWI%3D&amp;reserved=0" originalSrc="http://wiki.eprints.org/" shash="IaItJbolh36EbK7H21vVmItQcGzLy1ak02/AAvCduQKOWO6zUheRHnrk8a4dtXgwXVQ4bkde7JIdtEkT8bC6o+RnSsvtv2ZO0k7s4eZXst2uGcx8e2ZCgwhWeey5Y20Wop3EITAap1/GaZueRS82d2qGnpmsyxTzxh/Up1WC8m0=" target="_blank">http://wiki.eprints.org/</a></pre>
    </blockquote>
  <div id="gmail-m_6353439102992617730DAB4FAD8-2DD7-40BB-A1B8-4E2AA1F9FDF2"><br>
<table style="border-top:1px solid rgb(211,212,222)">
        <tbody><tr>
        <td style="width:55px;padding-top:13px"><a href="https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.avg.com%2Femail-signature%3Futm_medium%3Demail%26utm_source%3Dlink%26utm_campaign%3Dsig-email%26utm_content%3Demailclient&amp;data=01%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7C4c176746a6ac4b90867808d848416199%7C4a5378f929f44d3ebe89669d03ada9d8%7C0&amp;sdata=7OMROTe4qTNBV4Ja8IORPVG894zoAFZfU6XfeIdLabY%3D&amp;reserved=0" originalSrc="http://www.avg.com/email-signature?utm_medium=email&utm_source=link&utm_campaign=sig-email&utm_content=emailclient" shash="r9KFc2hcCXwMfUO/3csvDczU6GLDx/yKDtw8lUyaOqqgOFL97LC+ZbZ7Hf+kGeyIRD8wfSx5wJBKr9r+IQDgm5XUtfzsbFEYvlfpi7Qn5aJNQ2+yfUzow5CT6uOUICYuuvf5Gt8li7Ox7rCttM1m3IS3C3Vq3SfIoLNgH6YTHmo=" target="_blank"><img src="https://ipmcdn.avast.com/images/icons/icon-envelope-tick-green-avg-v1.png" alt="" width="46" height="29" style="width: 46px; height: 29px;"></a></td>
                <td style="width:470px;padding-top:12px;color:rgb(65,66,78);font-size:13px;font-family:Arial,Helvetica,sans-serif;line-height:18px">Virus-free. <a href="https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.avg.com%2Femail-signature%3Futm_medium%3Demail%26utm_source%3Dlink%26utm_campaign%3Dsig-email%26utm_content%3Demailclient&amp;data=01%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7C4c176746a6ac4b90867808d848416199%7C4a5378f929f44d3ebe89669d03ada9d8%7C0&amp;sdata=7OMROTe4qTNBV4Ja8IORPVG894zoAFZfU6XfeIdLabY%3D&amp;reserved=0" originalSrc="http://www.avg.com/email-signature?utm_medium=email&utm_source=link&utm_campaign=sig-email&utm_content=emailclient" shash="r9KFc2hcCXwMfUO/3csvDczU6GLDx/yKDtw8lUyaOqqgOFL97LC+ZbZ7Hf+kGeyIRD8wfSx5wJBKr9r+IQDgm5XUtfzsbFEYvlfpi7Qn5aJNQ2+yfUzow5CT6uOUICYuuvf5Gt8li7Ox7rCttM1m3IS3C3Vq3SfIoLNgH6YTHmo=" style="color:rgb(68,83,234)" target="_blank">www.avg.com</a>
                </td>
        </tr>
</tbody></table><a href="#m_6353439102992617730_DAB4FAD8-2DD7-40BB-A1B8-4E2AA1F9FDF2" width="1" height="1"> </a></div></div>

</blockquote></div>